Stimulus range and discontinuity effects on information-integration category learning and generalization

Maddox, W. Todd; Filoteo, J. Vincent

doi:10.3758/s13414-011-0101-2

Stimulus range and discontinuity effects on information-integration category learning and generalization

Published: 15 February 2011

Volume 73, pages 1279–1295, (2011)
Cite this article

Download PDF

Attention, Perception, & Psychophysics Aims and scope Submit manuscript

Stimulus range and discontinuity effects on information-integration category learning and generalization

Download PDF

W. Todd Maddox¹ &
J. Vincent Filoteo²

1127 Accesses
9 Citations
1 Altmetric
Explore all metrics

Abstract

Two studies are reported that support the hypothesis that categories that require a multiple-unit, as opposed to a single-unit, representation lead to worse initial acquisition but better generalization. On the basis of the constraints imposed by the procedural-based learning system thought to mediate information-integration categorization, we argue that the need to train multiple units during initial category acquisition slows the procedural-based category learning process and adversely affects learning performance. However, we speculate that better generalization occurs because of the increased likelihood that a novel stimulus will activate at least one of the multiple units needed to represent the category. Relations to other findings in the literature and the implications of this work for training and clinical assessment are discussed.

Deconstructing the effect of self-directed study on episodic memory

Article 19 June 2014

Cognitive Architecture and Instructional Design: 20 Years Later

Article Open access 22 January 2019

Cognitive load theory and educational technology

Article 01 August 2019

Understanding the experimental factors and psychological processes that facilitate acquisition and generalization has been the focus of psychological research since its inception. This focus has implications for education and training at all levels, from formal classroom training (e.g., reading, writing, and arithmetic), to skill learning (e.g., driving, airline screening, medical diagnosis, or radiology), to rehabilitation and interventions (e.g., improving memory and attention or reducing drug relapse).

One cognitive skill for which acquisition and generalization processes are critical is information-integration perceptual classification (Ashby, Alfonso-Reese, Turken, & Waldron, 1998; Ashby & Maddox, in press; Maddox & Ashby, 2004). These are classification problems for which there is no verbal analogue to the optimal rule and for which learning is gradual and incremental. Some of the most important occupations in our society—such as airport screening, radiology, or medical diagnosis, to name a few—involve classification, and we spend an enormous amount of time and money devoted to training individuals to perform these jobs. In a typical classification learning task, participants are presented with a stimulus and are asked to classify it into a single category. Once a response is made, participants are provided with corrective feedback. A common assumption in classification research (as well as other research areas) is that training regimens that lead to good initial acquisition when feedback is presented will also lead to good generalization to novel items within and outside the range of training items, even when corrective feedback is removed (for a review, see Schmidt & Bjork, 1992; Smith, Redford, Washburn, & Taglialatela, 2005).

Early learning theorists, however, recognized that this assumption is often false, and they noted that experimental factors that improve initial acquisition can either lead to good or poor generalization once feedback is removed (Estes, 1955; Hull, 1943; Skinner, 1938; Tolman, 1932). Perhaps counterintuitively, there is evidence in some domains that experimental factors that lead to worse initial acquisition actually lead to better generalization (Schmidt & Bjork, 1992). This pattern has been observed in the motor and verbal learning domains (Balota, Duchek, & Logan, 2007; Bjork, 1994; Bjork & Linn, 2006; Karpicke & Roediger, 2007, 2008; Landauer & Bjork, 1978; Roediger & Karpicke, 2006). For example, in motor learning, spaced practice of motor movements leads to worse initial acquisition but better generalization, whereas massed practice of the same movements leads to better initial acquisition but worse generalization (see, e.g., Shea & Morgan, 1979).

Given the important role that classification plays in many real-world skills, and given the fact that good acquisition training does not necessarily imply good generalization, it is critical to evaluate the efficacy of any training procedure by incorporating a transfer phase that includes novel items from within and outside the range of training items and during which feedback is not provided (Schmidt & Bjork, 1992). Although acquisition training usually involves presentation of a fixed set of items, the true test of generalization lies in one’s ability to classify not only items that are similar to the training items (i.e., those from within the range of trained items), but also items that are dissimilar from them (i.e., items from outside that range) (Erickson & Kruschke, 1998, 2002; Smith et al., 2005). We take this approach in the present study.

In addition, we use a model-based approach to determine how different training regimens affect the types of processes people use to perform that task and how these affect the nature of generalization. To anticipate, the model-based analysis turns out to be critical in the interpretation of the data.

Overview of the present study

The overriding aim of the present work is to examine the effects of category range and category discontinuity on both acquisition of and generalization from a broadly sampled set of stimuli. Category range is defined as the breadth of stimulus values along the stimulus dimensions (often referred to as category variance in the literature; Cohen, Nosofsky, & Zaki, 2001; Hahn, Bailey, & Elvin, 2005; Rips & Collins, 1993). Category discontinuity results when each category is composed of distinct subclusters of stimuli that are separated by unsampled portions of the stimulus space.

Category range and discontinuity effects have been examined in the literature, but often the two factors are confounded, making it difficult to determine their independent impacts. For example, Maddox, Filoteo, and Lauritzen (2007; for related work, see Kornell & Bjork, 2008; Maddox, Filoteo, Lauritzen, Connally, & Hejl, 2005) examined the effects of continuous versus discontinuous category training on information-integration acquisition and generalization. Scatterplots of the exemplars from the (small-range) continuous and discontinuous training conditions for the information-integration categories are displayed in Fig. 1 (along with the transfer items). Maddox et al. (2007) found that, for information-integration categories, acquisition was adversely affected by discontinuous category training, but that no-feedback transfer performance was better in the discontinuous training condition than in the continuous training condition. Unfortunately, discontinuity was confounded with range, making it impossible to determine whether the increased range or the category discontinuity led to the observed acquisition and generalization performance difference.

Real-world categories differ in their ranges and levels of continuity. For example, members of the category “hand guns” (which an airline screener must learn) are highly similar, and thus have a relatively small category range and are fairly cohesive (i.e., highly continuous). On the other hand, the category “weapon” is highly variable and contains items such as knives, bombs, guns, and so on, which are highly discontinuous. The differences in continuity and range between these two categories could have implications for acquisition and generalization under various training conditions. Thus, it is important to disentangle category range and discontinuity to understand the effects these factors would independently have on acquisition and generalization in the real world.

The present study provides an unconfounded test of the effects of category range and category discontinuity on information-integration acquisition and generalization, across two experiments. Each experiment includes a small-range continuous, a large-range continuous, and a discontinuous acquisition training condition. Comparison of performance across the small- and large-range continuous conditions provides a test of the effects of increased category range on acquisition and generalization while holding discontinuity constant, whereas comparison of performance across the large-range continuous and discontinuous conditions provides a test of the effects of category discontinuity on acquisition and generalization while holding category range constant.

A number of factors that are often left uncontrolled are held constant across our experimental conditions. These include the number of acquisition training trials, the nature of the optimal decision bound, and optimal accuracy. A no-feedback transfer phase is also included that tests performance for items from within the trained portion of the stimulus space and generalization to items outside the trained portion of the stimulus space.

As the results (presented below) suggest, one of these experiments supports the hypothesis that category discontinuity, and not category range, leads to poor initial acquisition but better generalization, whereas the other supports the hypothesis that category range, and not discontinuity, leads to poor initial acquisition but better generalization. Importantly, were we to focus our interpretation only on these empirical data, we would be left with a contradictory set of findings. However, by applying computational models, we offer a unified explanation of these findings that is consistent with the known processing characteristics of the procedural-based learning system that is thought to mediate information-integration classification acquisition (Ashby et al., 1998; Ashby & Ennis, 2006; Cincotta & Seger, 2007; Nomura et al., 2007). To do so, we apply a procedural-based learning model, called the striatal pattern classifier (SPC; Ashby & Waldron, 1999), to the data. To anticipate, the model-based analyses suggest that neither increased category range nor category discontinuity accounts for the results. Rather, the more direct mediator of performance appears to be whether a single-unit or multiple-units representation best represents each category. We now briefly outline the model-based approach.

Model-based approach

The goal of the model-based approach is two-fold. First, we use the models to determine whether and when participants are using the task-appropriate process—that is, a process consistent with the known characteristics of the procedural-based learning system—or an alternative process. We focus our analyses on those individuals who used the appropriate process, since they are the ones who will be most telling in regard to the questions we ask. Second, we use the models to determine when participants use a multiple-unit representation and whether these situations correspond to cases in which a multiple-unit representation is predicted. Although this approach will be important for future research, we have not used this report to further develop the SPC as a formal model of procedural-based classification learning.

The model-based approach involves applying three models separately to the data from each participant (the details are provided in the Appendix). The first of these is the SPC, which is a computational model whose processing is consistent with what is known about the neurobiology of the procedural-based category learning system thought to underlie information-integration classification performance (Ashby et al., 1998; Ashby & Ennis, 2006; Ashby & Waldron, 1999; Nomura et al., 2007; Seger & Cincotta, 2005). The second is rule-based and instantiates hypothesis-testing strategies such as the application of unidimensional or conjunctive rules. These are verbalizable strategies that are suboptimal in the present studies, but are often utilized by participants. The third is a random-responder model that assumes that the participant guesses on each trial. The model parameters are estimated using maximum likelihood procedures (Ashby, 1992; Wickens, 1982). When the models are nested, G ² (likelihood ratio) tests will be applied below in order to determine the best model. When models are not nested, the goodness-of-fit statistic will be the Akaike information crieterion:

$$ {\hbox{AIC}} = {2}r--{\hbox{2ln}}L, $$

where r is the number of free parameters and L is the likelihood of the model given the data (Akaike, 1974; Takane & Shibayama, 1992). The AIC statistic penalizes a model for extra free parameters in such a way that, the smaller the AIC, the closer a model is to the “true model,” regardless of the number of free parameters (for a discussion of the complexities of model comparisons, see Myung, 2000; Pitt, Myung, & Zhang, 2002).

Because the focus of this research is on information-integration learning and generalization, we here describe the SPC in more detail. The SPC assumes that stimuli are represented perceptually in higher-level visual areas, such as the inferotemporal cortex. Because of the massive many-to-one (approximately 10,000-to-1) convergence of afferents from the cortex to the striatum (Ashby & Ennis, 2006; Wilson, 1995), a low-resolution map of perceptual space is represented among the striatal units. During acquisition training, the striatal units become associated with one of the category labels so that, after acquisition training is complete, a category response label is associated with each of a number of different regions of perceptual space. In effect, the striatum learns to associate a response with clumps of cells in the visual cortex. When all of the stimuli coming from a category are perceptually similar and form a coherent (or continuous) group, the category can be represented by a small number of units. However, when the stimuli coming from a category are perceptually dissimilar and form a less coherent, or even discontinuous, group, a multiple-unit representation will be needed.

It is important to be clear that the SPC is a computational model inspired by what is known about the neurobiology of the striatum. Because of this fact, the striatal “units” are hypothetical and could be interpreted within the language of other computational models (e.g., as “prototypes” in a multiple-prototype model like SUSTAIN; Love, Medin, & Gureckis, 2004). In addition, we do not model learning with the SPC, in the sense that we do not update association weights between units and category labels. Learning models have been proposed (e.g., Ashby, Paul, & Maddox, 2011), but because our focus is on asymptotic acquisition and generalization (see below), computational versions of the model are adequate to capture behavior at the end of learning.

Acquisition and generalization predictions

If we make the reasonable assumption that category learning is computationally and biologically more efficient when each category can be represented by fewer rather than more units, it then follows that the procedural-based learning system should be more efficient when stimuli coming from the same category form a coherent (or continuous) group, because fewer units might then be required. Alternatively, it should be less efficient when stimuli coming from the same category are perceptually dissimilar and form a less coherent (or discontinuous) group, because more units might be required to represent the categories. In line with this prediction, previous research has suggested that initial acquisition is slower when more units are needed to represent each category, such as when the decision bound is nonlinear (Ashby & Maddox, 1990, 1992; Ashby & Waldron, 1999; Maddox et al., 2007). In fact, Maddox et al. (2007) found that more units were needed to account for final-block acquisition in their discontinuous condition than in their small-range continuous condition.

Although initial acquisition might be slower when a multiple-unit representation is required, it is reasonable to hypothesize that a more distributed multiple-unit representation might lead to better generalization. This follows because novel stimuli presented during generalization would be more likely to activate one of the many units associated with a multiple-unit representation and would be less likely to activate the one unit associated with a single-unit representation. Importantly, this prediction holds in cluster models such as SUSTAIN (Love et al., 2004). Taken together, this reasoning implies that the more striatal cells are involved in representing a category, the more difficult category acquisition will be, but the more likely it will be that a novel stimulus (e.g., stimuli presented during transfer) will be associated with that category. This leads to two predictions. First, multiple-unit SPC models should provide better model fits to data collected in generalization conditions for which a multiple-unit representation was needed during acquisition. Second, generalization accuracy rates should be higher in these conditions, and should be especially high when a multiple-unit SPC model provides the best account of the data.

We turn now to the experiments. For each experiment, we first generate performance predictions based on the category structure and the models. We then use the models to help organize the results by breaking participants into groups based upon the best-fitting model: SPC, rule-based, or random-responder.

Experiment 1

Experiment 1 used stimuli that were lines varying in length and orientation across trials. Scatterplots of the training stimuli in the small-range continuous, large-range continuous, and discontinuous conditions are displayed in Fig. 1, along with a scatterplot of the transfer stimuli. The transfer block included test items from within and outside the trained portion of the stimulus space. Each point in Fig. 1 denotes a unique stimulus, with each symbol denoting stimuli from different categories.

Maddox et al. (2007) found that the small-range continuous condition required a single-unit representation for each category, whereas the discontinuous condition require a multiple-unit (specifically, four-unit) representation. They reasoned that because the stimuli in each of the small-range continuous categories were tightly packed around a single, central prototype, a single-unit representation would suffice. They also reasoned that because the stimuli in each of the discontinuous categories were from discontinuous clusters of stimuli, a multiple-unit representation would be needed. In the large-range continuous condition of Experiment 1, the stimuli were not tightly packed around a central prototype, but they were continuously sampled and, more importantly, spread evenly around a central prototype. This made a single-unit representation (or, at the very least, one strong central unit surrounded by other weaker units) likely in this condition. Thus, for Experiment 1, a single-unit representation was predicted in the two continuous conditions, and a multiple-unit representation was predicted in the discontinuous condition. This implies that initial acquisition should be superior in the two continuous conditions but that generalization should be superior in the discontinuous condition.

Method

Participants

A total of 90 participants (30 per condition) completed the study and received course credit for their participation. All participants had normal or corrected-to-normal vision. Each participant served in only one condition, and all met a learning criterion of 55% in the final acquisition training block.

Stimuli and stimulus generation

The stimuli are displayed in Fig. 1, along with the optimal decision bounds. The category distribution parameters are outlined in Table 1, and optimal accuracy was 95%. In the small-range continuous condition and the discontinuous condition, each category was composed of four “subclusters” (16 total), with 30 stimuli being sampled randomly from each, for a total of 480 stimuli. In the large-range continuous condition, each category was composed of nine “subclusters” (36 total), with 13 stimuli being sampled randomly from each and 3 additional stimuli being sampled randomly from each category, for a total of 480 stimuli. The random samples were linearly transformed so that the sample mean vector and sample variance–covariance matrix equaled the population mean vector and variance–covariance matrix for each subcluster. Each random sample (x, y) was converted to a stimulus by deriving the length (in pixels) as l = x and the orientation (in degrees counterclockwise from horizontal) as o = 18y/50. These scaling factors were chosen to roughly equate the salience of each dimension. The resulting 480 stimuli were randomized and divided into five 96-trial blocks separately for each participant. These were presented during category acquisition training. A total of 144 stimuli (36 from each of the four response regions) were used during the no-feedback transfer phase (see Fig. 1).

Table 1 Category distribution parameters from Experiment 1

Full size table

Procedure

Each participant was run individually in a dimly lit testing room with an approximate viewing distance of 35 cm. The participants were informed that there were four equally likely categories. They were also informed that perfect performance was impossible, but that high levels of accuracy could be achieved. They were instructed to learn about the categories, to be as accurate as possible, and not to worry about their speed of responding. On each trial, the stimulus appeared and remained on the screen until the participant generated a response by pressing one of two keys. The correct category label was then presented on the screen for 1 s, along with the word “wrong,” if the response was incorrect, or “right,” if the response was correct. Once feedback was given, the next trial was initiated. The procedure for the transfer trials was identical, except that feedback was omitted.

Results

The Results section is organized as follows. First, we apply the models to the final acquisition block and to the transfer block to determine whether each participant was using a procedural-based, rule-based, or random process. Because of concerns with modeling aggregate data, each participant’s data were fit separately (e.g., Ashby, Maddox, & Lee, 1994; Estes, 1956; Maddox, 1999; Maddox & Ashby, 1998; Smith & Minda, 1998). Four versions of the striatal pattern classifier (SPC) were fit to the data (SPC-1, SPC-2, SPC-4, and the optimal model). The SPC-1 assumed one unit per category, the SPC-2 assumed two units per category, and the SPC-4 assumed four units per category. Models with more units were not examined since, at most, a category contained four clusters of stimuli. The optimal model assumed that the optimal decision bounds were applied. Each of the SPC models assumed that, on each trial, the participant determined which unit was closest to the perceptual effect and gave the associated response—with the only difference among the models being the difference in the number of units. If one of these four models provided the best account of the data, the participant was classified as an “SPC user.” A number of conjunctive and unidimensional hypothesis-testing models, as well as the random-responder model, were also applied to the data (see the Appendix). If one of the hypothesis-testing models provided the best account of the data, the participant was classified as a “rule-based user.” If the random-responder model provided the best account of the data, the participant was classified as a “random responder.”

Second, because our main focus is on participants who used procedural-based learning strategies during the final acquisition block and during the transfer block, we display the learning curves and transfer performance for participants who were classified as SPC users during both the final acquisition block and the transfer block. For these same participants, we also examine transfer performance in greater detail by examining transfer performance for items sampled from within the trained portion of the space separately from items sampled from outside the trained portion of the space. For completeness, we also display the learning curves and transfer performance for those participants who did not use a procedural-based learning strategy in the final acquisition block or the transfer block. Finally, we examine the nature of strategy shifts across the final acquisition and transfer blocks and the performance under various strategy-shift conditions. The focus of this analysis was to compare and contrast performance for single- versus multiple-unit SPC users.

Learning curves and transfer performance for participants best fit by the SPC in the final acquisition and transfer blocks

Figure 2a displays the average proportions correct for the small-range continuous, large-range continuous, and discontinuous conditions for each of the five acquisition training blocks and for the transfer block, only for those participants who were SPC users (i.e., those best fit by the SPC-1, SPC-2, SPC-4, or optimal model) in the final acquisition block and the transfer block. To reiterate, it was important to focus on these individuals because they are using the task-appropriate process. This analysis included 57%, 73%, and 93% of the participants from the small-range continuous, large-range continuous, and discontinuous conditions, respectively. A 3 Condition (small-range continuous vs. large-range continuous vs. discontinuous) × 5 Acquisition Block ANOVA was conducted. There was a significant effect of condition [F(2, 64) = 16.99, p < .001, η² = .347] that suggested worse acquisition in the discontinuous condition relative to the two continuous conditions (ps < .001 for both comparisons), with the latter two conditions showing no significant performance differences. There was a significant effect of block [F(4, 256) = 29.29, p < .001, η² = .314] suggesting that learning occurred, and the interaction was nonsignificant (F < 1). Thus, category range had no effect on initial acquisition, as suggested by a comparison of performance in the small- and large-range continuous conditions, whereas category discontinuity had a large attenuating effect on initial acquisition, as suggested by a comparison of performance in the large-range continuous and discontinuous conditions.

We also examined the change in performance from the final acquisition block to the transfer block. The performance drop was nonsignificant in the small-range continuous condition but was significant in the large-range continuous condition (p < .01). In the discontinuous condition, there was a significant performance increase (p < .01).

Figure 2b displays the transfer performance for items from within and outside the trained region (along with performance in the final acquisition block) for the participants displayed in Fig. 2a. The effect of condition was significant for the transfer items from within the trained region of the space [F(2, 64) = 6.39, p < .01, η² = .165] and was characterized by significantly worse performance in the small-range continuous condition (.70) than in the large-range continuous (.78) and discontinuous (.79) conditions (both ps < .01), with no performance difference emerging for the latter two conditions (n.s.). The effect of condition was nearly significant for the transfer items from outside the trained region of the space [F(2, 64) = 2.81, p = .068, η² = .081] and was characterized by a significant performance difference between the small-range continuous (.72) and discontinuous (.77) conditions (p < .05).

For completeness, Fig. 2c displays the learning curves and transfer block performance for the remaining 43%, 27%, and 7% of the participants from the small-range continuous, large-range continuous, and discontinuous conditions, respectively. These are the individuals whose final acquisition block or transfer block data were best fit by a rule-based model or by the random-responder model. Given the small sample size, ANOVAs were not conducted.

Taken together, these data suggest that for SPC users (those best fit by one of the SPC models in the final acquisition and transfer blocks), acquisition is worse but transfer is better in the discontinuous condition than in the two continuous conditions. This supports our initial claim that discontinuous categories should be more difficult to acquire but should lead to better transfer. What these data do not tell is whether this performance pattern is due to the increased use of multiple-unit representations in the discontinuous condition. To answer this important question, we turn now to a more detailed analysis that examines performance separately for single-unit SPC users and multiple-unit SPC users. As outlined in the introduction, we predict that a multiple-unit representation will be more likely to provide a better account of the data in the discontinuous than in the two continuous conditions.

Single- versus multiple-unit SPC analyses

The percentage of participants in each condition whose final acquisition block and transfer block of data were best fit by specific model pairings, along with the proportions correct achieved by those participants, is presented in Table 2. Five model pairings are examined. First, we examine performance for participants whose final acquisition block of data was best fit by the single-unit SPC (sSPC). These participants were divided into those whose transfer block was also fit by an SPC model (SPC-1, SPC-2, SPC-4, or optimal; henceforth, sSPC–SPC) or by some other model (i.e., one of the hypothesis-testing models or the random-responding model; SPC–other). Next, we examine performance for participants whose final acquisition block of data were best fit by a multiple-unit SPC (mSPC; SPC-2, SPC-4, or optimal model in transfer block). These participants were divided into those whose transfer block was also fit by an SPC (mSPC–SPC) or by some other model (mSPC–other). Finally, we examined performance for all participants whose final acquisition block of data was best fit by a rule-based model or the random-responder model.

Table 2 Model results from Experiments 1 and 2 for the final acquisition block and the no-feedback transfer block (see the text for details)

Full size table

Several comments are in order. First, the proportion of mSPC–SPC participants was largest in the discontinuous condition (47%) and was smaller in the small-range (13%) and large-range (20%) continuous conditions. This supports our hypothesis that discontinuous training, relative to continuous training, is more likely to lead to a multiple-unit representation. Second, final-acquisition-block accuracy was higher for mSPC–SPC participants than for any other participant group, and this was especially salient in the discontinuous condition. Finally, transfer block accuracy was higher for mSPC–SPC participants than for any other participant group, and this also was especially salient in the discontinuous condition.

Discussion

These data suggest that discontinuous category training leads to worse initial acquisition but better performance from the final acquisition block to the transfer block, whereas continuous-category training (even when the range of values is equated with that from the discontinuous categories) leads to better initial acquisition but a decrease in performance from the final acquisition block to the transfer block. The model-based analyses suggest that discontinuous, but not continuous, categories are more likely to lead to a multiple-unit representation, and that this might explain the finding of worse initial acquisition but better transfer in the discontinuous conditions. In Experiment 2, we tested this hypothesis against the alternative hypothesis that category discontinuity alone drives the effect. To achieve this aim, we compared discontinuous-condition performance against performance in a continuous condition for which a multiple-unit representation was likely.

Experiment 2

Experiment 1 examined category range and discontinuity effects in a four-category information-integration task and found support for the hypothesis that discontinuous-category training leads to worse initial acquisition but better generalization when category range is held constant. To our knowledge, this is the first study to rigorously test these two hypotheses.

Notice from Fig. 1 that the exemplars from each category in the discontinuous condition are sampled from four subclusters of stimuli. Many computational models of categorization would predict that under these conditions some sort of multiple-unit (e.g., SPC; Ashby & Waldron, 1999), multiple-prototype (e.g., rational model; Anderson, 1991), or multiple-cluster (e.g., SUSTAIN; Love et al., 2004) representation would be required for learning. In other words, each of these subclusters of stimuli would be represented by a separate unit, prototype, or cluster, with each of these being assigned to a specific category. When presented with a new item, the distance between the item and each unit would be calculated and the item assigned to the category associated with the nearest unit.^{Footnote 1} Notice also that the exemplars from each category in the small-range and large-range continuous condition are sampled continuously from the space and are spread evenly around a central prototype. This makes a single-unit representation (or at the very least one strong central unit surrounded by other weaker units) likely in this condition. These observations, along with the finding that acquisition was worse but generalization was better in the discontinuous condition, support the hypothesis that the need for a multiple-unit representation impedes initial learning but results in better generalization.

In Experiment 2, we tested the generalizability of these findings to a two-category case (Maddox et al., 2005) for which multiple units would likely be required to represent the categories, even though no discontinuity existed. Scatterplots of the training stimuli in the small-range continuous, large-range continuous, and discontinuous conditions are displayed in Fig. 3, along with a scatterplot of the transfer stimuli. The transfer block included novel items from within and outside the range of training items, to evaluate generalization. First, notice that each category of training items in the discontinuous condition is composed of two distinct and dissimilar clusters of items. Thus, a two-unit representation per category would likely be required. Next, notice that each category of training items in the large-range condition, although not composed of discontinuous clusters, is composed of items that are “spread” out parallel along the decision bound. Using prototype terminology, there is no single prototype or central tendency that adequately describes the training stimuli. In this case, it is likely that a multiple-unit representation would also be required. Finally, notice that each category of training items in the small-range condition is tightly packed, likely yielding a single-unit representation. If the large-range continuous condition does, in fact, require a multiple-unit representation, then the performance pattern in that condition should be similar to that observed in the discontinuous condition, which should yield poorer initial acquisition but better generalization relative to the small-range continuous condition.