A fundamental property of the visual system is the ability to segregate objects from one another in a visual scene. For instance, when attempting to grab a book from a cluttered desktop, we are able to select and act upon the book with relative ease, despite the presence of other irrelevant objects. This ability to effectively segregate objects in a visual scene is the result of several visual processes, one of which is figure–ground assignment. During figure–ground assignment, candidate objects (“figures”) are segregated from backgrounds, allowing us to process and act upon only a focused subset of the information present in a scene. This process is trivial in situations in which a single object is present on a uniform background; however, in most real-world visual scenes, we are confronted with a number of objects that overlap and partially occlude one another, making figure–ground assignment much more difficult. As a result, sophisticated visual mechanisms must be implemented to effectively select which regions are associated with a particular contour and, consequently, distinguish figures from grounds.

Control of figure–ground assignment

Beginning with the influential work of the Gestalt psychologists, research on figure–ground assignment has focused on the factors that control the segregation of candidate objects from their backgrounds. These cues to figure–ground assignment can be viewed as control settings that determine which regions in a scene are likely to be perceived as figures. Extensive evidence suggests that both bottom-up and top-down factors control figure–ground assignment (see Palmer, 1999, 2002, and Wagemans et al., 2012, for reviews).

Initial studies focused on the bottom-up control of figure–ground assignment. For example, Rubin (1915/1958) noted that area influences figure–ground assignment, in that smaller regions are more likely to be perceived as figure than are larger regions. Likewise, both symmetric regions (i.e., those with matching left and right parts; Bahnsen, 1928) and convex (i.e., outwardly bulging) regions are more likely to be viewed as figure (Kanizsa & Gerbino, 1976; Pomerantz & Kubovy, 1986; Rock, 1975). More recently, Vecera, Vogel, and Woodman (2002) showed that regions falling below a horizon line (i.e., the “lower region”) of a scene are more likely to be perceived as figure than those falling above a horizon line. These stimulus-based cues are important because they are invariant properties of figures that can allow viewers to perform figure–ground assignment even when confronted with novel scenes or scenes containing novel objects. As a result, the stimulus properties themselves are often sufficient to drive figure–ground assignment, affording the visual system the flexibility required to make segregation processes efficient even in situations in which little or no top-down information is available (see Vecera & O’Reilly, 2000).

In addition to the influence of stimulus properties on figure–ground assignment, there is evidence that top-down inputs control figure–ground assignment. For example, object familiarity influences which region of a display is perceived as figure (Peterson, 1994; Peterson & Gibson, 1993; Rock, 1975). When observers view figure–ground stimuli that contain a familiar or meaningful region (e.g., a face in profile), they tend to see that region as figure. However, if that same display is rotated 180°, familiarity affects figure–ground assignment much less, if at all, suggesting that both top-down and image-based properties interact with one another during figure–ground assignment.

Other top-down processes also affect figure–ground assignment. For example, attending to a particular region of a display makes that region more likely to be perceived as figure (Driver & Baylis, 1996; Vecera, Flevaris, & Filapek, 2004). Furthermore, Vecera et al. (2004) provided evidence that when both image-based and top-down cues (convexity and attention, respectively) were present in the same display, these cues competed with one another to influence which region observers would see as figure. This combination of both top-down and bottom-up cues in the control of figure–ground assignment fits both with interactive accounts of figure–ground segregation (Vecera & O’Reilly, 1998, 2000; see also Craft, Schütze, Niebur, & von der Heydt, 2007; Roelfsema, Lamme, Spekreijse, & Bosch, 2002) and with more general accounts of “biased competition” in visual processing (Desimone & Duncan, 1995; see also Vecera, 2000). Figure–ground assignment can thus be viewed as a competition between two regions for figural status, and various cues bias the figural assignment to one region over others.

Consequences of figure–ground assignment

As we outlined above, much is known about the factors that control figure–ground assignment. However, less is known about the possible consequences of this process on later visual processing. Phenomenologically, figures appear to be more shape-like and more salient than grounds, and there is evidence that the visual system processes figures differently from grounds, producing a “figural benefit” in different visual tasks. For instance, as was reported by Rubin (1915/1958), when viewing an ambiguous face/vase stimulus, holding the vase as figure makes it appear more distinct and shape-like than the opposing faces, which lack form and fall into the background (see also Koffka, 1935). Perhaps because figures appear more shape-like, they are more likely to be remembered, both in the long term (several minutes or longer; Rubin, 1915/1958; see also Dutton & Traill, 1933; but see Rock & Kremen, 1957) and in the short term (a few hundred milliseconds; Driver & Baylis, 1996, Vecera et al., 2004). Although figures are processed differently than grounds, the underlying cause of this processing difference is unknown.

An early set of experiments by Wong and Weisstein (1982) examined the influence of figure–ground assignment on target detection. In their experiments, observers viewed Rubin’s ambiguous face/vase stimulus and held either the vase or the faces as figure. Target detection was enhanced when the target fell within the region observers currently perceived as the “figure,” suggesting a perceptual advantage for targets appearing in figure relative to background regions. Similarly, Nelson and Palmer (2007) showed that when observers viewed a figure–ground display in which one region contained a meaningful, or familiar, cue to figure–ground assignment (e.g., a face in profile), targets falling within this region were detected more quickly, likely because perceptual processing of these “figures” began prior to perceptual processing of grounds (Lester, Hecht, & Vecera, 2009; see also Hecht, Spencer, & Vecera, 2015). In addition to this effect of figural status on detection times, Nelson and Palmer also found better discrimination for targets appearing in figures.

One straightforward interpretation of the latter finding is that figure–ground assignment produces a perceptually enhanced representation of the figural region, which then allows target detection and discrimination to occur efficiently for targets on the figure. Perceptual enhancement could be a consequence of enhanced neural representation of figures, relative to ground regions, as has been predicted by neurophysiological studies and models of figure–ground assignment (e.g., Craft et al., 2007; Roelfsema et al., 2002; Vecera, 2000; Vecera & O’Reilly, 1998, 2000). However, from the previous studies outlined above, it is unclear whether the behavioral processing benefits observed for targets appearing on figures are due to perceptual enhancement of the regions assigned as figure or to the attentional effects of figure–ground assignment. In other words, it is possible that these previously reported figural benefits are the result of attention being drawn to figures, rather than benefits related to figure–ground assignment per se. Such a possibility is plausible, because a number of studies have demonstrated perceptual processing benefits for items appearing at attended locations (see Carrasco, 2006, 2011, for reviews).

Thus, our focus in the present work was to determine whether or not figure–ground assignment increases the perceptibility of the figure, making the figure and targets appearing within it more perceptually salient, or whether the previously found enhancement effects were the result of attentional processes resulting from figure–ground assignment. We developed a task in which observers employed an attentional set for color (à la Folk, Remington, & Johnston, 1992) to direct attention to one region of a multicolor display, and we manipulated figure–ground assignment orthogonally to this attention setting by using image-based convexity cues. More specifically, observers performed a perceptually demanding spatial resolution task, by responding to targets within a specific colored region (red, green) that was equally likely to be the figure (convex) or ground (concave) region (Fig. 1). Observers reported the spatial offset of two vertices (see Baylis & Driver, 1993; Gibson, 1994). By manipulating the size of the offset of the relevant stimuli, we could measure perceptual sensitivity and assess the effects of figure–ground assignment on perceptual-level processing. Since attention should be drawn strongly to the region matching the observers’ attentional set for color (Folk et al., 1992), regardless of that region’s figural status, any benefit in spatial resolution for figures should result primarily from the perceptual enhancement of figures relative to grounds. By the same logic, if no difference in spatial resolution performance were found between figures and grounds, the figural benefits observed in previous studies were likely due to attention and not to perceptual enhancement.

Fig. 1
figure 1

a Examples of the stimuli used in each experiment. Figure position (left, right) and color (red, green) were fully counterbalanced. b Samples of apex offsets. The shifted apex was either higher or lower than the central (fixed) apex

Experiment 1

Method

Participants

Nineteen University of Iowa undergraduates received course credit for volunteering; all reported having normal or corrected-to-normal vision. This experiment complied with all ethical standards, as enforced by the University of Iowa’s Institutional Review Board.

Stimuli

The figure–ground displays, shown in Fig. 1, consisted of two regions and were designed such that one region of the display was clearly figural in nature, on the basis of image-based convexity cues (see Baylis & Driver, 1993). One region was red and the other was green, with each color being equally likely to appear on the left or the right and to define either the figure (convex) or ground (concave) region. The ground region measured 4.83° × 5.05° of visual angle and contained two concave angles (135°) that were separated by 2.5° of visual angle. The ground region shared a contour with a figure (subtending 3.35° × 7.77° of visual angle) containing two convex angles (also 135°) that were also separated by 2.5° of visual angle. The position of the innermost apex that was shared by the two regions was held at a constant location in the center of the display (i.e., at fixation), whereas either of the two outermost apices (either that on the figure or that on the ground) was offset 0.04°, 0.11°, 0.22°, 0.33°, 0.44°, or 0.55° of visual angle above or below this fixed apex (see Fig. 1b). This design allowed us to obtain a measure of observers’ spatial resolution by having them make positional judgments of one of the outermost apices (either that on the green region or that on the red region) relative to the fixed central apex. The stimuli were viewed from a distance of approximately 77 cm, and from this distance each display subtended approximately 7.91° × 7.77° of visual angle.

We presented a total of 48 figure–ground displays: 2 color assignments (figure red, figure green) × 2 positions (figure left, figure right) × 2 offset directions (above fixed apex, below fixed apex) × 6 angle offsets (0.04°, 0.11°, 0.22°, 0.33°, 0.44°, 0.55°).

Procedure

The sequence and timing of events are depicted in Fig. 2. Each trial began with a fixation point, which was visible for 500 ms. The figure–ground display was then presented for 200 ms, too brief a duration for observers to make eye movements. The screen then remained blank until observers had made a response, with a 500-ms intertrial interval preceding the presentation of the next trial.

Fig. 2
figure 2

Sequence and timing of events for a single trial in each experiment

In order to fully counterbalance both color and response, each observer responded in one of four ways during the spatial resolution task. First, observers attended to either the red or green region of the figure–ground display, according to the experimenter’s instructions; half of the observers attended the red region of the display, and the other half attended the green region of the display throughout the experiment. Within each color condition, observers indicated which of the two apices, that on the right of the region or that on the left of the region, was either higher or lower; in other words, half of the observers reported which apex in the attended color region was higher, and half reported which apex was lower. For instance, an observer in the “green–higher” condition would attend the green region of the display and respond which of the apices (that on the left or right of the green region) was higher relative to the other. Observers responded using a button box, with the left index finger on the button for “left” responses and the right index finger on the button for “right” responses. Each observer received a block of 96 practice trials, which were not analyzed, followed by eight blocks of 96 trials.

Results and discussion

The data analyses include only observers with response accuracies significantly above chance (defined as 60%, in this study). This criterion resulted in the exclusion of three participants from the final analysis, leaving data from 16 participants (n = 4 in each response condition) in the following analyses.

We analyzed the accuracy data with a two-factor analysis of variance (ANOVA), with Figural Status (apices on figure or ground) and Apex Offset (0.04°, 0.11°, 0.22°, 0.33°, 0.44°, or 0.55°) as factors. These data appear in Fig. 3. We observed a significant main effect of apex offset, F(5, 75) = 21.6, p < .0001, η p 2 = .59, with larger offsets being discriminated more accurately than smaller offsets. Observers discriminated targets falling in the figural region consistently more accurately than those falling in the ground region (82.5% vs. 75.3%, respectively), as evidenced by the significant main effect of figural status, F(1, 15) = 8.01, p = .01, η p 2 = .35. This main effect was accompanied by a significant two-way interaction, F(5, 75) = 10.9, p < .001, η p 2 = .42, indicating that spatial discriminability across the differing offsets was greater for figures than for grounds. We conducted planned comparisons to examine the figural benefit at each of the apex offsets. Planned comparisons showed that observers discriminated the offset of the apices significantly more accurately when the apices appeared on figures relative to grounds at all but the smallest (0.04°) and largest (0.55°) offsets (all significant ts > 2.4, ps < .02), possibly due to a floor and a ceiling effect, respectively.

Fig. 3
figure 3

Results of Experiment 1. Points represent the mean accuracy data for targets appearing in figure and ground regions, as a function of apex offset. Error bars represent 95% within-subjects confidence intervals

These results suggest that figural status affected performance in our spatial resolution task. Given that we controlled for attention by requiring observers to adopt a consistent attentional set for color, the greater accuracy for spatial resolution judgments in figural than in ground regions appears to be the result of a perceptual enhancement effect. As such, it appears that regions assigned figural status possess an enhanced perceptual representation over regions assigned as grounds.

Although our results are consistent with an enhancement account, it is possible that the improved performance for targets appearing on figures is not the result of figure–ground assignment per se, but is instead due to a physical stimulus confound. Specifically, on trials in which the target appeared within the figure, observers compared points on two convexities, whereas on trials in which the target appeared on the ground, they compared points on two concavities. Thus, in Experiment 2 we separated the two regions of our display, such that they no longer shared a contour and thus appeared as two figures on a larger background. If the increased performance for targets appearing on figures results from an enhanced representation of the figure during figure–ground assignment, we would expect no difference in performance depending on which region of the display the target appeared in. However, if our results were instead due to a confound between figure–ground status and the convexity/concavity of the shape on which the discrimination was performed, we would expect results identical to those observed in Experiment 1.

Experiment 2

Method

Participants

Sixteen University of Iowa undergraduates received course credit for volunteering; all reported having normal or corrected-to-normal vision. This experiment complied with all ethical standards, as enforced by the University of Iowa’s Institutional Review Board.

Stimuli

The stimuli were identical to those used in Experiment 1, with the following exceptions: The two regions were separated by 0.44° (0.22° on either side of fixation), appearing as two figures on a larger background. These separated displays subtended 8.20° × 7.77° of visual angle.

Procedure

The procedure was identical to that of Experiment 1.

Results and discussion

The accuracy data are shown in Fig. 4. These data were analyzed using a two-way ANOVA identical to that used in Experiment 1. As in Experiment 1, we observed a significant main effect of apex offset, F(5, 75) = 38.9, p < .0001, η p 2 = .72, with larger offsets being discriminated more accurately than smaller offsets. However, in contrast to Experiment 1, we did not observe a significant main effect related to which region contained the target, F(1, 15) < 1. This suggests that the results of Experiment 1 were not due to the shapes of the regions themselves, but instead to an enhancement of regions assigned figural status.

Fig. 4
figure 4

Results of Experiment 2. Points represent the mean accuracy data for targets appearing in figure and ground regions, as a function of apex offset. Error bars represent 95% within-subjects confidence intervals

However, another alternative interpretation of our results must also be addressed. Specifically, it is possible that rather than being due to an enhancement of figural regions, the results of Experiments 1 and 2 were driven by a suppression of ground regions (Peterson & Skow, 2008; Salvagio, Cacciamani, & Peterson, 2012). Furthermore, if figure–ground assignment represents a competitive process (Vecera & O’Reilly, 1998, 2000), it may be the case that both figure enhancement and ground suppression contributed to the results above. In order to adjudicate between these possible mechanistic accounts, in Experiment 3 we interspersed the figure–ground displays from Experiment 1 with the figurally “neutral” displays used in Experiment 2, in which the objects no longer shared a contour, and thus should not show competitive influences with one another (Salvagio et al., 2012; Vecera & O’Reilly, 2000).

Thus, if the effects observed in Experiment 1 were due solely to figural enhancement, in Experiment 3 we would expect to find no difference in performance for targets on neutral stimuli or ground regions, but a benefit for targets appearing on figures. Conversely, if the effect was driven by ground suppression, performance would decrease for targets on ground regions relative to equivalent performance for targets on neutral stimuli and figures. A combination of figural enhancement and ground suppression would lead to discrimination accuracy for targets appearing on neutral stimuli falling somewhere in-between those for targets appearing on figures versus grounds.

Experiment 3

Method

Participants

Sixteen University of Iowa undergraduates received course credit for volunteering; all reported having normal or corrected-to-normal vision. This experiment complied with all ethical standards, as enforced by the University of Iowa’s Institutional Review Board.

Stimuli

The stimuli were identical to those used in Experiments 1 and 2, with the following exception: Separated displays (Exp. 2) were interspersed with the figure–ground stimuli (Exp. 1).

Procedure

The procedure was identical to those in Experiments 1 and 2, except for the following modifications. To keep the number of trials identical to that in the previous experiments, we only used three offsets (0.04°, 0.33°, and, 0.55°). In half of the trials, the target appeared within either the “figure” region or the “ground” region of the stimuli used in Experiment 1. In the other half of the trials, the target appeared in one of the two (convex or concave) figurally “neutral” regions (formerly the figure and ground regions, respectively) of the stimuli used in Experiment 2. The sequence of events and timing were identical to those in the other experiments, with a 96-trial practice block (which was not analyzed) being followed by eight testing blocks of 96 trials each.

Results and discussion

The accuracy data are shown in Fig. 5. These data were analyzed using a two-way ANOVA similar to those used in the previous experiments, with the exception that figural status now had three levels (apices on figure, ground, or “neutral” region), and apex offset also had only three levels (0.04°, 0.33°, or 0.55°). As in the previous experiments, we observed a significant main effect of apex offset, F(2, 30) = 156.2, p < .0001, η p 2 = .91, with larger offsets being discriminated more accurately than smaller offsets across all stimulus types. As in Experiment 1, we also observed a significant main effect of figural status, F(2, 30) = 18.4, p < .0001, η p 2 = .55, and an interaction between figural status and offset, F(4, 60) = 3.52, p = .01, η p 2 = .19.

Fig. 5
figure 5

Results of Experiment 3. Points represent the mean accuracy data for targets appearing in convex and concave regions, as a function of apex offset. Error bars represent 95% within-subjects confidence intervals

As can be seen in Fig. 5, we observed a significant benefit to spatial resolution accuracy when the target appeared in the figure region, as opposed to when it appeared in either the ground or neutral regions, at all but the smallest offset, replicating the results of Experiment 1 and providing support for a perceptual-enhancement account of figure-related processing benefits. This was confirmed by planned comparisons that showed a significant benefit in accuracy for targets appearing on figure relative to neutral regions (0.33° offset: t = 2.67, p = .02; 0.55° offset: t = 3.12, p < .01). However, we also observed a significant cost to accuracy when the target appeared in the ground region relative to either the figure or a neutral region, suggesting a concurrent perceptual suppression of ground regions, consistent with previous work (Peterson & Skow, 2008; Salvagio et al., 2012). This was confirmed by planned comparisons that showed a significant decrement in accuracy for targets appearing on ground relative to neutral regions (0.33° offset: t = 3.37, p < .01; 0.55° offset: t = 2.88, p = .01).

Thus, when observers viewed interspersed figure–ground and figurally neutral displays, discrimination accuracy was highest for the targets falling within figural regions, lowest for targets falling within ground regions, and intermediate for targets falling in neutral regions. This provides direct evidence that both enhancement of figures and the suppression of grounds are likely responsible for figure-related benefits in visual processing.

General discussion

Our results demonstrate that when targets appear within regions assigned figural status, they are discriminated consistently more accurately than when they appear within ground regions. Additionally, when figure–ground and figurally neutral displays were both presented (Exp. 3), targets appearing on figures were discriminated more accurately than those on either neutral or ground regions, whereas targets appearing on grounds were discriminated less accurately than those on either neutral or figure regions. We hypothesize that the increased discriminability of targets in figural regions may arise from an enhanced neural representation of figures, whereas the decreased discriminability of targets appearing on ground regions may arise from a suppressed neural representation of targets appearing within these regions. Such a view is consistent with previous primate neurophysiological studies (Lamme, 1995; Marcus & Van Essen, 2002; Qiu et al., 2007) and with models of figure–ground assignment (e.g., Craft et al., 2007; Roelfsema et al., 2002; Vecera 2000; Vecera & O’Reilly, 1998, 2000).

Given the close relationship between figure–ground assignment and attention, the sensory-gain control accounts used to explain attentional effects (e.g., Hawkins et al., 1990; Hillyard, Vogel, & Luck, 1998) can be readily adapted to explain the increased saliency of figures following figure–ground assignment. It is possible that the phenomenological advantage of figures described by Rubin (1915/1958) and others may result from the enhancement of regions assigned as figure and the suppression of regions assigned as ground; in other words, the sensory gain for figures is “turned up,” whereas the gain for grounds is “turned down.” However, our results suggest that this gain control is independent of attention, because in our task observers’ attention was directed to each region on the basis of its color and not of its status as a figure or a ground. Because the region containing the target was manipulated orthogonally to figural status, and because the attentional set for the color of the target-containing region should have strongly biased attention toward that region (cf. Folk et al., 1992), the differences we observed in discrimination accuracy for figures and grounds cannot be attributed solely to attention-related enhancement effects. This also suggests that figure–ground assignment can proceed independently of the focus of attention, consistent with reports that figure–ground assignment can occur preattentively and without attention resources (Driver, Baylis & Rafal, 1992; Kimchi & Peterson, 2008).

Consistent with this interpretation, a number of neurophysiological studies in nonhuman primates have shown that neuronal firing rates in response to figural regions are increased relative to firing rates in response to ground regions, even when those regions are outside the focus of attention. For instance, Lamme (1995) examined the responses of individual striate cortex (V1) neurons in macaque monkeys to figures versus grounds, as defined by the orientations of line segments in an array. When the segment that formed the figure region was located within a neuron’s receptive field, he observed a 40% increase in firing rates as compared to when the receptive fields were located on the ground region. Similarly, Marcus and Van Essen (2002) showed an enhancement of responses in macaque V1 and V2 neurons when the figure fell within a cell’s receptive field relative to when the ground fell within the receptive field. Importantly, Marcus and Van Essen manipulated the location of attention within the visual display. Responses to figure and ground regions were similar regardless of the location of attention, suggesting that perceptual enhancement and suppression effects were independent of those associated with attentional processes.

Finally, in a study by Qiu et al. (2007) monkeys were presented with overlapping figure–ground displays, designed such that one region of the display (the “figure”) occluded the other region (the “ground”). They showed that the firing rates of V2 neurons whose receptive fields were located on the border between the two regions were increased following the assignment of border ownership to the occluding figure (a process analogous to figure–ground assignment), regardless of whether attention was directed to that region or the occluded ground region. Critically, Qiu et al. also demonstrated that when attention was directed to the occluding “figure” region there was further enhancement of neuronal responses in the V2 neurons of interest. The authors interpreted this result as providing evidence for an interactive account of object-based attentional selection, in which the V2 neurons involved in figure–ground segregation also provide an interface for top-down attentional selection processes. This supports the assertion that figure–ground assignment produces enhancement effects that precede, but may interact with, attentional enhancement processes.

Additionally, the perceptual enhancement following figure–ground assignment characterized in the present experiments and in Qiu et al.’s (2007) study may instead reflect an enhancement of well-defined shapes. In these studies, the ground was likely to be perceived as “shapeless,” continuing behind the figure. In other words, the figure–ground displays contained a shared contour that was associated with the figure, resulting in a well-defined figure and an ambiguously shaped ground. Our experiments cannot determine whether or not a shared contour is required in order to observe a perceptual enhancement of figures. Instead, this effect might reflect a processing advantage for completed shapes and not for “shapeless” regions. Future experiments could modify the stimuli such that the figure–ground displays contained well-defined figures and grounds. If perceptual enhancement were found, it could be concluded that enhancement extends to all figures, and not simply to those juxtaposed with a “shapeless” ground.

As we noted in the introduction, Nelson and Palmer (2007) demonstrated both detection and discrimination benefits for targets appearing in figural regions over those appearing in ground regions, a result they interpreted as being due to attention-related enhancement processes. Specifically, they discussed their results as being due to a preferential allocation of attention to figures following figure–ground assignment, which led to attention-related processing benefits for targets falling within the figure. However, the results of our experiments, coupled with the neurophysiological studies outlined above, suggest that regions assigned figural status need not draw attention in order to receive processing benefits. This points toward the possibility that the results of Nelson and Palmer may be due to an interaction between enhancements due to figure–ground assignment and due to attention. A similar argument could be made for the present experiments, which cannot definitively rule out the possibility of an interaction between each of these enhancement effects. For example, figure–ground assignment might occur quickly enough for the figure to draw attention prior to shifting to the task-relevant colored region. However, the neurophysiological effects showing clear evidence of perceptual enhancement even in the absence of attention, coupled with the strong manipulation of feature-based attention used in the present work, suggest a role for perceptual enhancement in driving at least a portion of our observed figural processing benefits. Future experiments should explore the timescale over which each of these enhancement effects operates.

The notion of a bidirectional relationship between figure–ground assignment and attention is consistent with the results of Vecera et al. (2004), who found that when two regions of a figure–ground display are ambiguous with respect to figural status, directing exogenous attention to one of the regions influences which item is perceived as figure. They also showed that even when strong image-based cues to figure–ground assignment were present, the location of attention still affected figure–ground assignment (see also Cosman & Vecera, 2010). This interaction of figure–ground assignment and attention fits well with theories and computational models of figure–ground assignment (e.g., Roelfsema et al., 2002; Vecera, 2000; Vecera & O’Reilly, 1998). For example, in Vecera’s (2000) “biased-competition” model of figure–ground assignment, the perceptual representations of figures are more “active” than those of grounds, causing figures to compete for visual processing resources more effectively. This increased activity can result either from image-based cues to figure–ground assignment, such as the convexity cues used in our study, or from top-down influences of object recognition or attention processes. The results of the present study suggest that following figure–ground assignment, an early enhancement of figures can serve to bias the allocation of visual resources such as attention or memory to particular items in a scene.

Taken together, our results suggest that objects assigned figural status have an enhanced perceptual representation over ground regions, underscoring the complex relationship between figure–ground assignment and attention by showing that these processes interact closely to affect the perception of items in a scene. In the future, it will be important to better understand the nature of this relationship and the constraints that this interaction puts on the later perceptual processing of objects.