Method
Participants
Twenty-six participants (18 females) took part in this experiment, none of whom had taken part in Experiment 1. All the participants were students (age range, 18–24 years), who took part for course credit and had normal vision.
Stimuli, apparatus and design
The 40 encoding stimuli from Experiment 1 were used again here. Each participant saw all the stimuli once, divided into two blocks of 20 trials. Half the stimuli were landscapes, and half were interiors. Two viewing conditions were used: viewing with either a horizontally or vertically oriented elliptical gaze-contingent window (Fig. 6). Viewing condition was blocked within participants, and block order was counterbalanced. Across participants, all the stimuli appeared equally often in both conditions.
The gaze-contingent display functioned in the same way as in Experiment 1. However, rather than a completely masked background, a low-pass-filtered version of the current stimulus appeared outside the window of fixation. The low-pass versions were produced in Adobe Photoshop by convolution with a Gaussian blur filter, the standard deviation of which was 0.5°. This relatively severe level of blur attenuated spatial frequencies greater than approximately 2 cycles/deg, frequencies which are well above perceptual thresholds, even at large eccentricities. This level of blur was chosen, on the basis of pilot studies, to be noticeable to participants, while still enabling general scene content to be determined from the blurred image (for manipulations of the degree of peripheral blur necessary for participants to become aware of the manipulation, see Loschky et al., 2005). The window around fixation was a regular ellipse with major axes of 12.5° and 3.1° (or vice versa for the vertical window), which were the same as the dimensions of the rectangular windows used previously.
Procedure
We again used the picture–sentence verification task from Experiment 1. Participants saw all images in a random order, with the instructions that they should “look carefully at the pictures so as to verify the accuracy of the following sentence”.
The procedure began with instructions, and the eyetracker was calibrated as previously. Two practice trials were then given, in order to familiarize participants with the apparatus and task. The experiment proper then began, and each trial proceeded in exactly the same way as in Experiment 1. No subsequent memory test was given.
Analysis and Results
Eye movements
We look first at general eye movement parameters before examining the saccade direction and amplitude distributions. Our comparisons of interest were (1) whether the orientation of the window (horizontal or vertical) affected viewing and (2) whether the effect in these blurred conditions was the same as with the rectangular, masked windows in Experiment 1. If the effects from the first experiment were caused by a reluctance to orient to a blank mask, the results with a blurred periphery should be more similar to normal viewing and should show less of an effect of window shape.
General eye movement measures
The mean number of fixations per trial, fixation duration and saccade amplitude were subjected to a 2 × 2 mixed ANOVA with the within-subjects factor of window shape (horizontal or vertical) and the between-subjects factor of experiment. For both number and duration of fixations, there were no differences between conditions or experiments and no interactions, all Fs < 2.3, ps > .14. Moreover, independent t tests yielded no differences in these measures between either condition in Experiment 2 and normal viewing in Experiment 1, both ts(40) < 1.
There was no main effect of experiment on saccade amplitude, F(1, 40) < 1: Masking and blurring led to saccades of a similar mean length. Although there was an interaction between experiment and window shape, F(1, 40) = 6.3, p < .05, saccades in Experiment 2 remained longer with a horizontal ellipse window (M = 4.4°, SE = 0.21) than with a vertical window (M = 4.0°, SE = 0.19), t(25) = 3.5, p = .001. This is the same difference as that seen in Experiment 1, although it was not as severe. In both conditions, saccades remained shorter than those seen in normal viewing, both ts(40) > 7, both ps < .001. This is consistent with the findings from Experiment 1: Gaze-contingent windows led to shorter saccades.
Thus, the window manipulation in this experiment produced results similar to those in Experiment 1, although the number and duration of fixations was relatively less affected (as compared with normal viewing) than in the fully masked conditions in Experiment 1.
Saccade direction
The pattern of saccade directions was very similar to that in Experiment 1: Most saccades moved left or right, but in the vertical condition, the eyes also moved up and down quite frequently (see Fig. 7).
The mean HVP was .71 (SE = .01) in the horizontal condition and .59 (SE = .01) in the vertical condition. There were more vertical saccades and fewer horizontal ones when the elliptical window was vertically oriented, t(25) = 8.8, p < .001. The vertical condition was reliably different from normal viewing, t(40) = 7.0, p < .001, but the horizontal condition did not differ reliably, t(40) < 1. There was no evidence that the pattern was any less pronounced than in Experiment 1 (no effect of experiment and no interaction; both Fs(1, 40) < 1).
Saccade amplitude
Figure 8 shows the distribution of saccade amplitudes for horizontal and vertical saccades in the two window shapes. We compared the participant medians (in order to account for the skew in the data), computing an ANOVA with 2 saccade directions ×2 window shapes, and with an additional between-subjects factor of experiment (to compare blurring and masking).
Overall, saccades were marginally shorter with the blurred display than in Experiment 1 (Experiment 1, M = 3.6°; Experiment 2, M = 3.2°), F(1, 40) = 3.6, p = .065. Main effects of direction, F(1, 40) = 32.3, p < .001, and window shape, F(1, 40) = 27.8, p < .001, and an interaction between them, F(1, 40) = 368.8, p < .001, confirmed the pattern in Experiment 1: With a horizontal window, horizontal saccades peaked at higher amplitudes than did vertical saccades (Ms = 4.2° vs. 2.9°), but with a vertical window, the pattern was reversed and vertical eye movements were longer (Ms = 3.6° vs. 3.1°). There was also an interaction between experiment and direction, F(1, 40) = 4.4, p = .04. Horizontal saccades had a higher median than did vertical saccades, but this difference was less pronounced in Experiment 2 (Ms = 3.3° and 3.1° for horizontal and vertical saccades, respectively) than in Experiment 1 (Ms = 3.9° and 3.3°). Finally, there was a three-way interaction, suggesting that the shift in the amplitude of saccades of different directions elicited by different windows was different in Experiment 2, F(1, 40) = 15.1, p < .001. However, although the differences were less pronounced in Experiment 2, breaking down the effects within this experiment showed the same result: Direction interacted with window shape, F(1, 25) = 144.2, p < .001. With a horizontal ellipse window, horizontal saccades (M = 3.8°) had larger median amplitude than did vertical saccades (M = 2.8°). With a vertical ellipse, the opposite was true (horizontal, M = 2.8°; vertical, M = 3.3°).
How often did saccades move outside the window of preserved visibility? Saccades were just as likely to move outside the window in Experiment 2 as in Experiment 1, t(40) < 1. As before, the direction of these saccades followed the orientation of the window (see also the dashed lines in Fig. 8) With a horizontal window, more vertical than horizontal saccades landed outside the window (69% vs. 37%), and with a vertical window, the opposite was true (39% of vertical saccades vs. 75% of horizontal saccades). Thus, blurring the peripheral information, as opposed to masking it completely, did not seem to affect the deployment of saccades outside the window. How were these eye movements controlled, given that the features at their landing site were invisible or degraded? The next section looks at the properties of these saccades in more detail.
The control of saccades beyond the window
We performed additional analyses of all the eye movements from the horizontal and vertical conditions (since these were the most comparable between experiments). First, we checked whether the trends in saccade direction held for the saccades outside the window. All of these eye movements were targeted at masked or blurred regions. If they also followed the direction of the window, it would indicate control based on memory or expectations of what was there (in the case of a masked periphery) or on limited, low-spatial-frequency information (in Experiment 2).
It was most appropriate for this analysis to compare saccades of similar amplitude, so we looked at all saccades longer than 6.5°, placing their endpoints beyond the window in all conditions from both experiments (Experiment 1, N = 2,304 saccades; Experiment 2, N = 6,313). Window shape continued to have an effect on the direction of these long saccades, F(1, 40) = 40.5, p < .001. However, this interacted with experiment, F(1, 40) = 24.6, p < .001. In Experiment 1, even large saccades were more likely to be horizontal with a horizontal window (mean HVP = .84) than with a vertical window (mean HVP = .52), paired t test, t(15) = 5.1, p < .001. This same trend was reduced in Experiment 2 (horizontal window, HVP = .71; vertical window, HVP = .67), although it did reach one-tailed significance, t(25) = 1.77, p = .04. Thus, the differences in saccade direction in Experiment 1 were also found in large saccades that were targeted at locations outside the window, even though there was no difference in the information at these points. However, when peripheral information was blurred, window shape had less of an effect on large saccades. On those occasions in which gaze moved outside the window, an increase in the information in the periphery ameliorated the affect of the gaze-contingent window on saccade direction.
We also analysed the properties of all eye movements landing outside the window, in order to test two specific predictions about the control of these saccades. First, it might take longer to initiate a saccade to these locations, perhaps because masking or blurring reduces the saliency of points beyond the window boundary. To test this, we looked at the duration of the fixation preceding the eye movement: Systematically longer fixation durations would suggest an increased preparation time for these saccades. Second, given that the saccades landed on targets that were masked or blurred, they should not be ideally positioned and might lead to a corrective eye movement. For example, people rarely fixate an empty background, but they may have erroneously done so if these regions were masked. If this happened often, participants may have terminated the resulting fixation early and made a short saccade to a more optimal position. We therefore computed the duration of the following fixation and the amplitude of the following saccade, and we predicted shorter fixations and smaller saccades after saccades landing outside the window. In each case, we compared the average for saccades landing within the window with that for saccades landing outside, in masked viewing (Experiment 1) and blurred viewing (Experiment 2), collapsed across horizontal and vertical window shapes.
Saccades outside the window were not associated with reliably longer prior fixation durations in either Experiment 1 (outside, M = 245 ms; inside, M = 249 ms) or Experiment 2 (outside, M = 249 ms; inside, M = 250 ms), both ts < 1. However, in Experiment 1, the fixation following a saccade outside the window (M = 232 ms) was reliably shorter than one directed within the window (M = 257 ms), t(15) = 6.5, p < .001. The same trend was also reliable in Experiment 2 (outside, M = 242 ms; inside, M = 251 ms), t(25) = 2.3, p < .05. The median amplitude of the saccade following an eye movement outside of the window was slightly shorter than that following an eye movement within the window in both experiments. This difference was negligible in Experiment 1 (outside, M = 3.65°; inside, M = 3.69°), t(15) < 1, but reached significance in Experiment 2 (outside, M = 3.13°; inside, M = 3.36°), t(25) = 3.2, p < .005. To summarize these results, whether a saccade moved inside or outside the window made no difference to the previous fixation duration. However, in line with our predictions, the subsequent fixation was shorter in duration and the following saccade had a smaller amplitude if an eye movement went outside the window.
The effect of scene type
We previously reported a reduced horizontal bias and more vertical saccades when participants viewed interior scenes than when they viewed landscapes (Foulsham et al., 2008). In the present research, the windowed conditions reduced the availability of peripheral scene content. Was saccade direction in these conditions sensitive to the type of scene? If the change in eye movements in interiors occurs because scene type is recognized from peripheral information, we would expect similar scanning in both landscapes and interiors in the moving-window conditions, particularly in Experiment 1, where the background was completely masked. Looking at the interaction between window shape and scene type will also help us explore the relationship of the windowed conditions to normal scene viewing.
Figure 9 shows the HVP calculated separately for the 20 images that were landscapes and the 20 interior scenes. There were reliable effects of scene type [Experiment 1, F(1, 15) = 11.4, p < .005; Experiment 2, F(1, 25) = 67.3, p < .001], with more horizontal saccades in landscapes than in interiors. Furthermore, there was no interaction with window condition [Experiment 1, F(3, 45) < 1; Experiment 2, F(1, 25) = 1.7, p = .2] and no scene type × experiment interaction, F(1, 40) = 2.3, p = .14. Thus, the sensitivity of eye movements to the type of scene does not depend on the availability of peripheral information.
Distribution of saccade endpoints
A residual question from our experiments concerns the location of fixations around the scene. This is important for two reasons. First, there are several biases known to affect the overall spatial distribution of fixations in an image, such as a central bias, and it is interesting to ask whether the gaze-contingent window modified these biases. Second, Najemnik and Geisler (2008) showed that both human observers and an ideal observer model tended to fixate in a “donut”-shaped region around the centre of a search display and, particularly, at the top and bottom of this ring. This pattern complemented their analyses of saccade amplitudes and direction: Although horizontal saccades were more likely in their study, they suggested that in order to maximize information, initial, infrequent vertical saccades moved fixation towards the top or bottom, which was then explored with more frequent but shorter horizontal saccades. In sum, fixations were most common at the top and bottom of the display, which, according to these authors, indicated optimal positioning of the horizontally elongated region of visibility. Thus, looking at the fixation distribution is a key way in which to distinguish between alternative interpretations of our own data.
We plotted the spatial distribution of saccade endpoints as a function of condition (columns in Fig. 10). To compare normal viewing with windows of a different shape, we will present this for the normal and square viewing conditions from Experiment 1, alongside the horizontal and vertical conditions from Experiment 2 (the distributions were similar in the asymmetrical window conditions from the first experiment). In each case, plots were generated by cumulatively adding a 2-D Gaussian patch (σ = 1°) at the location of the endpoint of each saccade. High values, represented by warm colours in the map, indicate the locations where fixations were most common. To explore trends over time, the rows in Fig. 10 split the data by saccade index within each trial (saccades 1–5, 6–10 and so on). Saccades after the 20th saccade were not included in this analysis, because some participants did not make that many saccades in all the conditions and, so, there were too few data points.
The plots reveal several interesting trends. First, there is a strong central bias in all the conditions, which occurs in the first few saccades but becomes less pronounced in later saccades. Second, the most frequently inspected points are around the image horizontal, whereas the top and bottom of the image are more likely to be neglected. Third, in general, there was a leftward bias, particularly in the first five saccades, where 69% of all the saccades landed in the left side of the image. Finally, the moving-window conditions resulted in some differences in the distribution of saccade endpoints, particularly when one looks at the 6th–10th saccade in the trial. In both the square window and vertical window conditions, there was a strong asymmetry in the plots: Saccades were more likely to move to the left of the image than to the right. This contrasts with the distribution seen in normal viewing, which is more evenly distributed, and that in the horizontal window condition, which actually seemed to produce a rightward bias in saccades 6–15.
This experiment replicated the main finding from Experiment 1—that a vertically oriented window reliably reduced the horizontal bias—and extended that experiment in several ways. First, the effect remained in stimuli where peripheral information was blurred, rather than being completely removed, which is important because this is similar to the way that information is disrupted in natural vision. Second, the effect remained for large saccades that moved outside the window. Third, saccades outside the window were followed by shorter fixations and smaller saccades, demonstrating that decreased information at the saccade destination affected subsequent eye movements. Fourth, the gist of the scene moderated the pattern of saccade direction, even when peripheral information was masked, such that interiors led to fewer horizontal saccades and more vertical ones. Finally, although there was evidence for a central bias in the distribution of fixations, the concentration of fixations at the top and bottom of the display that was reported by Najemnik and Geisler (2008) was not seen in the task and stimuli used here.
General discussion
We investigated how the shape of the information around fixation affected some of the patterns in eye movement scanning during an unconstrained encoding task. We will begin by characterising normal scanning, before discussing the effect of different gaze-contingent windows and the implications for models of eye guidance in scenes.
Normal and gaze-contingent scanning
In Experiment 1, we replicated some of the eye movement biases that have been seen in other image-viewing tasks. There was a strong central bias in normal viewing, and this was strongest at the start of scene viewing (during the first five saccades). This is likely because the starting viewing position was constrained by the experiment to be at the centre of the screen and, presumably, as time went on, people were more likely to have moved further from the centre. Other factors that have been suggested to contribute to the central bias are the distribution of salient features or objects in the scene (photographer bias) or orbital reserve, and Tseng, Carmi, Cameron, Munoz, and Itti (2009) and Tatler (2007) have considered these factors in detail. There was also a slight leftward bias at the start of viewing, which is consistent with the results of Dickinson and Intraub (2009), who recently reported a leftward asymmetry in scene perception. The preference to move to the left side of the image was found across a range of scenes and, therefore, seems unlikely to be caused by an uneven distribution of features or objects within the scene (for further discussion of the role of image features in saccade asymmetries, see Foulsham & Kingstone, 2010).
The saccades made in normal viewing also showed biases in direction and amplitude. There was a marked tendency for making horizontal saccades, rather than vertical or oblique eye movements. Most saccades were between about 2° and 7° in amplitude, indicating that they tended to target regions on the parafovea or extending into the periphery, but horizontal saccades were longer than vertical saccades, on average. Why were there more (and longer) horizontal saccades? The bias in the present study was probably exacerbated by the fact that images (and the visible monitor) were landscape in orientation, that scanning started in the centre, and that the image was always presented in the same egocentric reference frame (meaning that the horizontal position of the eyes and biases in the movement of the extraocular muscles may have had an effect). However, we have shown previously that a horizontal bias persists even in the absence of these cues (Foulsham et al., 2008). With random start locations and square images that were rotated from their canonical orientation, that study demonstrated that the horizontal bias was scene centred, rather than egocentric.
Gaze-contingent windows had some general effects on scanning, some of which have been reported elsewhere. First, the saccades made in these conditions were shorter, on average, than those in normal viewing, confirming the influence of peripheral information on saccade guidance and suggesting that a more conservative strategy was employed that targeted features within the window. This would also explain why the gaze-contingent conditions elicited somewhat less dispersed endpoint distributions and a greater central bias: The window curtailed long saccades, so that fixations remained closer to the centre for longer. Removal of peripheral information also had a detrimental effect on memory for the scenes: It took longer to recognise scenes that had been viewed through a gaze-contingent window, consistent with a detriment in encoding in these conditions (see Saida & Ikeda, 1979).
There was mixed evidence for an effect of peripheral masking on the number or duration of fixations. In Experiment 1, some of the window conditions resulted in more fixations, with a slightly lower average duration, than did those in normal viewing, but this was not found in Experiment 2. van Diepen and d'Ydewalle (2003) also found an increased number of fixations with peripheral masking of line drawings of scenes, although this study reported an increase in fixation durations under these conditions. Loschky and McConkie (2002) also reported longer fixation durations in viewing with a low-pass-filtered periphery, which we did not find here, perhaps because we used larger windows. This discrepancy might also occur because, in our study, viewing was limited by a fixed trial duration. The increased difficulty of the gaze-contingent encoding in Experiment 1 was reflected in an increase in the number of fixations, perhaps because each object or region of interest had to be fixated multiple times. This interpretation is consistent with a sequential model of attention in scene perception where fixation durations reflect processing close to fixation and are relatively unaffected by peripheral information. On the other hand, having low-resolution information in the periphery (Experiment 2) was sufficient for eliciting fixations that were not significantly more frequent or longer than in normal viewing.
The effect of window shape
In the introduction, we offered two possible hypotheses for how a horizontal and vertical window would change patterns in scanning direction. First, if saccades were guided in order to maximize the information gained on each fixation (defined as revealing new areas of the scene), a horizontal window would lead to more vertical saccades in order to avoid previously seen regions, with the opposite being true in the case of a vertical window. Second, if saccades were guided towards features that were currently visible, a horizontal window would lead to more horizontal saccades. Our findings point unanimously to the latter explanation: A vertical window produced more vertical saccades than in the other conditions, even though there were fewer unseen areas to be explored by moving up and down. This difference between window conditions was found even on the very first saccade. The pattern of saccade amplitudes was also systematically related to the dimensions of the window: Saccades with an amplitude and direction matching the boundary of the window were made frequently. We can be confident that these findings are not artefacts of windows with straight edges and a completely masked periphery, because the findings were replicated in Experiment 2 with elliptical apertures and a blurred periphery. In addition, there was no evidence for a trade-off in terms of the horizontal bias and a tendency to fixate at the top and bottom of the display, as was found by Najemnik and Geisler (2008). In fact, the top and bottom of the scene were relatively neglected in all conditions.
Several conclusions can be made on the basis of the saccade direction and amplitude results. First, because a bias for horizontal saccades persisted even with a square window (where visible information was equal in all directions), this bias must be partly driven by experience or knowledge about landscape-oriented images and monitors. Second, this default bias for horizontal eye movements was modified by an asymmetric window, consistent with a strategy of targeting points that could already be seen within the high-resolution window. In the vertical window condition, there were more features above and below fixation, and fewer to the left or right, and so the higher frequency of vertical saccades might reflect a tendency for people to move towards the information within the window.
A potential problem with this interpretation is that there was a significant number of saccades that were large enough to be targeted outside the window. How were these saccades controlled, and how did masking and blurring affect their occurrence? Loschky and McConkie (2002) also found that the radius of a gaze-contingent window shortened saccades, and they interpreted this pattern of results as evidence that peripheral filtering reduces the saliency of points outside the window, making them less likely to win the competition for the next saccade. Surprisingly, in our study, saccades were no more likely to move outside the window when the periphery was blurred than when it was completely masked. This further supports the argument that, when given the choice between high-resolution information and masked or above-threshold filtered information, the eye movement system tends to saccade within the window. Furthermore, even long saccades outside the window tended to go in the direction of the elongated boundary. One possibility is that these saccades target features on or near the window boundary but overshoot this destination, due to noise in the saccadic system. This would predict a distribution of amplitudes with the mode at the radius of the boundary, which is similar to what we find. It is also possible that participants were driven to “follow” partially seen objects or details which extended into the masked space, therefore allowing them to make a reasonable prediction about what was there before they planned their saccade.
Another possibility is suggested by the properties of the preceding and following fixations and of the following saccade. It has been suggested that eye movement events in scene viewing can be divided into local clusters of exploratory fixations of long duration with short saccades, separated by larger amplitude, “global” shifts to a new region (Unema et al., 2005). We found that saccades outside the window were followed by shorter fixations and smaller amplitude saccades than were saccades within the window. This suggests that saccades outside the window may have been qualitatively different, global shifts which moved the eyes from one period of local scanning (within the window) to another. Why, then, was the subsequent fixation atypically brief? It is likely that, because the information at this point was degraded at the start of the saccade, its positioning was suboptimal and, therefore, participants terminated the fixation early and made a small re-adjustive saccade. It may be that viewing with a gaze-contingent window exaggerates the local/global viewing strategy, and this would be worth exploring in further research.
Implications for natural scene viewing
An additional point of interest concerns the relationship between the gaze-contingent conditions and normal viewing. Across several measures, a horizontal window led to less of a difference from normal scanning than did a vertical or square window. Specifically, in Experiment 1, a horizontal window had less of an impact on mean saccade amplitude and on the direction and amplitude distributions than did a vertical or square window. This was also true in Experiment 2, and it suggests that viewing with a horizontal window was less disruptive and more normal for participants. The implication here is that during normal vision, the visible region most important for eye guidance is elongated in the horizontal direction. This is consistent with the visibility maps measured by Najemnik and Geisler (2005), albeit in a rather different task.
Although the results emphasise the importance of currently visible features in our task, we should be cautious about making further claims about human eye movement strategies in natural viewing. Window shape is only one of several factors in determining the pattern of saccade directions, and we also confirmed that scene type makes a difference. More horizontal saccades were made when landscapes were viewed than when interiors were viewed, probably because interesting features were arranged along the horizontal, and this did not interact with window shape. The lack of information-seeking saccades may have occurred because participants did not have experience and knowledge about the visibility of the artificial windows and, so, resorted to a more feature-driven approach. It is highly likely that saccade targeting is based on multiple sources of information that can be weighted differently (see Brouwer & Knill, 2007, for an example of this cue integration in visually guided reaching). Therefore, the challenge for modelling is to combine the drive towards currently visible features and the desire to maximize information in a way that explains the present data. For example, perhaps locations close to the boundary of the window (which were frequently fixated in the present study) represent a trade-off between targeting visible features and revealing new information. This is another interesting avenue for future study.
In conclusion, the experiments reported here point to two important principles regarding the control of saccades in scene viewing. First, rather than aiming only to maximize the information on each saccade, the eye movement system in our encoding task targeted features within the regions of foveal or parafoveal visibility. Second, this region is elongated horizontally, which may be an important factor in the horizontal saccade bias that has been observed in natural scene perception. With this foundation in place, the opportunity for future investigations is vast. In addition to questions of local/global exploration and the modelling of saccade targeting, researchers can use the gaze-contingent window to examine the role of bottom-up and top-down features in search, scanning in different tasks and how changes in window shape might be used to enhance exploration in patient populations, such as those with left-side neglect.