Post-training flexibility in category learning

Yang, Lee-Xieng; Chiang, Po-An

doi:10.3758/s13423-023-02451-7

Post-training flexibility in category learning

Brief Report
Published: 05 February 2024

(2024)
Cite this article

Psychonomic Bulletin & Review Aims and scope Submit manuscript

Lee-Xieng Yang^1,2 &
Po-An Chiang¹

129 Accesses
Explore all metrics

Abstract

Exemplar models of categorization, which assume that people make classification decisions based on item information stored in memory, typically assume that all of the exemplars are available and inform decision-making. However, in this study, we hypothesized that people may selectively emphasize subsets of exemplars, giving rise to individual differences in categorization. To verify this hypothesis, we adopted the partial-XOR category structure in Conaway and Kurtz (Psychonomic Bulletin & Review, 24, 1312–1323 2017), which has been evident to be able to induce two major response patterns in the transfer phase: the Proximity and XOR patterns. “Experiment 1” confirmed that these two patterns could be generated if participants were trained with only the exemplars of one category or the other. In “Experiment 2”, participants were asked to not only learn the category labels of all exemplars but also memorize the exemplars of only Category A (Condition A), only Category B (Condition B), or two categories (Condition AB) for a recognition test after the training phase of the categorization task. As expected, in the transfer phase, the participants tended to perform the XOR and Proximity patterns, when the exemplars of Category A and Category B were respectively targeted for the recognition test. The parameters of the SDGCM estimated by Bayesian inference for modeling the data of “Experiment 2” showed that the exemplar accessibility of Category A was larger than that of Category B for performing the XOR pattern and vice versa for performing the proximity pattern, hence verifying our hypothesis.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Generalization in Distant Regions of a Rule-Described Category Space: a Mixed Exemplar and Logical-Rule-Based Account

Article 16 August 2022

Does practice in category learning increase rule use or exemplar use—or both?

Article 08 January 2018

Individual differences in category learning: Memorization versus rule abstraction

Article 15 October 2014

Open Practices Statement

The data and codes are available at https://osf.io/t9e6k/?view_only=f729f30209694007a061d69cc9eb74fa.

Notes

The similarity between two items is the negative exponential function of the distance between them.
We have tried a batch design in which participants underwent all training blocks and then all review blocks in another experiment. The results remained the same.
Exemplars within a category share a same accessibility.

References

Ashby, F. G., & Gott, R. E. (1988). Decision rules in perception and categorization of multidimensional stimuli. Journal of Experimental Psychological: Learning, Memory, and Cognition, 14, 33– 53.
Ashby, F. G., & Maddox, W. T. (1993). Relations between prototype, exemplar, and decision bound models of categorization. Journal of Mathematical Psychology, 37, 372–400.
Article Google Scholar
Austerweil, J. L., Liew, S., Conaway, N. B., & Kurtz, K. J. (2019). Creating something different: Similarity, contrast, and representativeness in categorization
Bowman, C. R., & Zeithamova, D. (2022). Training set coherence and set size effects on concept generalization and recognition. Journal of Experimental Psychology: Learning, Memory, and Cognition, 46(8), 1442–1464.
Google Scholar
Cohen, A., Nosofsky, R., & Zaki, S. (2001). Category variability, exemplar similarity, and perceptual classification. Memory & Cognition, 29, 1165–1175.
Article CAS Google Scholar
Conaway, N., & Kurtz, K. J. (2017). Similar to the category, but not the exemplars: A study of generation. Psychonomic Bulletin & Review, 24, 1312–1323. https://doi.org/10.3758/s13423-016-1208-1
Article Google Scholar
De Schryver, M., Vandist, K., & Rosseel, Y. (2009). How many exemplars are used? Explorations with the Rex Leopold I model. Psychonomic Bulletin & Review, 16(2), 337–343.
Article Google Scholar
Denton, S. E., Kruschke, J. K., & Erickson, M. (2008). Rule-based extrapolation: A continuing challenge for exemplar models. Psychonomic Bulletin & Review, 15(4), 780–786.
Article Google Scholar
Erickson, M. A. (2008). Executive attention and task switching in category learing: Evidence for stimulus-dependent representation. Memory and Cognition, 36, 749–761.
Article PubMed Google Scholar
Erickson, M. A., & Kruschke, J. K. (1998). Rules and exemplars in category learning. Journal of Experimental Psychological: General, 127, 107–140.
Article CAS Google Scholar
Hsu, A. S., & Griffiths, T. E. (2010). Effects of generative and discriminative learning on use of category variability. In S. Ohlsson & R. B. Catrambone (Eds.), Proceedings of the 32nd annual conference of the cognitive science society. TX: Cognitive Science Society, Austin. Available at SSRN: https://ssrn.com/abstract=1802162
Jamieson, R. K., Holmes, S., & Mewhort, D. J. K. (2010). Global similarity predicts dissociation of classification and recognition: Evidence questioning the implicit-explicit learning dissociation in amnesia. Journal of Experimental Psychology: Learning, Memory, and Cognition, 36, 1529–1535.
PubMed Google Scholar
Knowlton, B. J., & Squire, L. R. (1993). The learning of categories: Parallel brain systems for item memory and category knowledge. Science, 262, 1747–1749.
Article ADS CAS PubMed Google Scholar
Kruschke, J. K. (1992). ALCOVE: An exemplar-based connectionist model of category learning. Psychological Review, 99, 22–44.
Article CAS PubMed Google Scholar
Kurtz, K. J. (2007). The divergent autoencoder (DIVA) model of category learning. Psychonomic Bulletin & Review, 14, 560–576.
Article Google Scholar
Lin, D. L., & Little, D. R. (2023). Further tests of sequence effects in a modified Garner task using separable dimensions. Journal of Experimental Psychology: General, 152, 1080–1121.
Article PubMed Google Scholar
Little, D. R., Wang, T., & Nosofsky, R. M. (2016). Sequence-sensitive exemplar and decision-bound accounts of speeded-classification performance in a modified Garner-tasks paradigm. Cognitive Psychology, 89, 1–38.
Article PubMed Google Scholar
Nosofsky, R. M. (1986). Attention, similarity, and the identification-categorization relationship. Journal of Experimental Psychological: General, 115, 39–57.
Article CAS Google Scholar
Nosofsky, R. M., Clark, S. E., & Shin, H. J. (1989). Rules and exemplars in categorization, identification, and recognition. Journal of Experimental Psychology: Learning, Memory, and Cognition, 15, 282–304.
CAS PubMed Google Scholar
Nosofsky, R. M., & Hu, M. (2022). Generalization in distant regions of a rule-described category space: A mixed exemplar and logical-rule-based account. Computational Brain & Behavior, 5, 435–466.
Nosofsky, R. M., Little, D. R., & James, T. W. (2012). Activation in the neural network responsible for categorization and recognition reflects parameter changes. PNAS, 109, 333–338.
Article ADS CAS PubMed Google Scholar
Nosofsky, R. M., & Meagher, B. J. (2022). Retention of exemplar-specific information in learning of real-word high dimensional categories: Eviden from modeling of old-new recognition. Proceedings of the Auunal Meeting of the Cognitive Science Society, 44, 100–106.
Google Scholar
Nosofsky, R. M., Palmeri, T. J., & McKinley, S. C. (1994). Rule-plus-exception model of classification learning. Psychological Review, 101, 53–79.
Article CAS PubMed Google Scholar
Nosofsky, R. M., & Zaki, S. R. (1998). Dissociations between categorization and recognition in amnesic and normal individuals: An exemplar-based interpretation. Psychological Science, 9(4), 247–255.
Article Google Scholar
Palmeri, T. J., & Flanery, M. A. (1999). Learning about categories in the absence of traing: Profound amnesia and the relationship between perceptual categorization and recognition memory. Psychological Science, 10(6), 526–530.
Article Google Scholar
Peirce, J. W. (2007). Psychopy-psychophysics software in python. Journal of Neuroscience Methods, 162, 8–13.
Article PubMed PubMed Central Google Scholar
Rips, L. J. (1989). Similarity, typicality, and categorization. In S. Vosniadou & A. Ortony (Eds.), Cambridge University Press, New York, US
Sakamoto, Y., Jones, M., & Love, B. C. (2008). Putting the psychology back into psychological models: Mechanistic versus rational approaches. Memory & Cognition, 36(6), 1057–1065.
Article Google Scholar
Shen, J., & Palmeri, T. J. (2016). Modeling individual differences in visual categorization. Visual Cognition, 24 (3), 260–283. https://doi.org/10.1080/13506285.2016.1236053
Smith, J. D., & Minda, J. P. (2000). Thirty categorization results in search of a model. Journal of Experimental Psychology: Learning, Memory, and Cognition, 26(1), 3–27.
CAS PubMed Google Scholar
Stewart, N., & Brown, G. D. A. (2005). perceptual categorization. Journal of Mathematical Psychology, 49, 403–409.
Article MathSciNet Google Scholar
Stewart, N., Brown, G. D. A., & Chater, N. (2002). Sequence effects in categorization of simple perceptual stimuli. Journal of Experimental Psychology: Learning, Memory, and Cognition, 28(1), 3–11.
PubMed Google Scholar
Stewart, N., & Chater, N. (2002). The effect of category variability in perceptual categorization. Journal of Experimental Psychology: Learning, Memory, and Cognition, 28, 893–907.
PubMed Google Scholar
Yang, L.-X., & Huang, T.-L. (2021). Exemplar account for category variability effect: Single category based categorization. In Proceedings of the Annual Meeting of the Cog- nitive Science Society, 43, 2699–2705.
Google Scholar
Zaki, S. R., & Nosofsky, R. M. (2001). Exemplar accounts of blending and distinctiveness effects in perceptual old-new recognition. Journal of Experimental Psychology: Learning, Memory, and Cognition, 27(4), 1022–1041.
CAS PubMed Google Scholar
Zaki, S. R., & Nosofsky, R. M. (2007). A high-distortion enhancement effect in the prototype-learning paradigm: Dramatic effects of category learning during test. Memory & Cognition, 25, 3088–3096.
Google Scholar

Download references

Funding

The raw data and R codes are available at https://osf.io/t9e6k/?view_only=f729f30209694007a061d69cc9eb74fa and this study was not preregistered

Author information

Authors and Affiliations

Department of Psychology, National Chengchi Univerity, No. 64, Section 2, Zhinan Rd, Taipei City, Wenshan District, 116, Taiwan
Lee-Xieng Yang & Po-An Chiang
Research Center for Mind, Brain, and Learning, National Chengchi University, No. 64, Section 2, Zhinan Rd, Taipei, Taiwan
Lee-Xieng Yang

Authors

Lee-Xieng Yang
View author publications
You can also search for this author in PubMed Google Scholar
Po-An Chiang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Lee-Xieng Yang.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix: Computational modeling with the SDGCM

The SDGCM (Stewart & Brown, 2005) is an exemplar model, in which categorization is based on the similarity and dissimilarity of the to-be-classified item to all exemplars. As is the same as most exemplar models, the SD-GCM also assumes that each item is represented as a dot in the psychological space and the distance between items \(x_i\) and \(x_j\) is computed as \(d_{ij} = (\sum _m \alpha _m (|x_{mi} - x_{mj}|^p))^{1/q}\), where \(\alpha _m\) is the selective attention on dimension m, \(\sum _m \alpha _m = 1\). The parameters p and q determine the distance metric (e.g., \(p = q = 1\) for the city-block distance and \(p = q = 2\) for the Euclidean distance). We tried and found that the SDGCM could better generate our observed data in our two experiments when the Euclidean distance was used. The similarity between \(x_i\) and \(x_j\) is transferred from the distance between them as \(S_{ij} = e^{-cd_{ij}}\), where c is the specificity of the stimulus space; the larger c, the more specific each item is. The dissimilarity between \(x_i\) and \(x_j\) is computed as \(DS_{ij} = 1 - S_{ij}\). The probability that an item is classified as Category A is computed as \(P(A) = \frac{(\beta _A\nu _A)^\gamma }{(\beta _A\nu _A)^\gamma + (\beta _B\nu _B)^\gamma }\), where \(\nu _A\) and \(\nu _B\) are the valences for Category A and Category B, respectively, and \(\gamma \) is the decision parameter; the larger \(\gamma \), the more deterministic the categorization decision is. The bias parameters \(\beta _A\) and \(\beta _B\) were set as equal in this study, for the exemplars of the two categories were presented equally frequently.

The valence for Category A \(\nu _A\) is the sum of the total similarity of \(x_i\) to all exemplars of Category A and the total dissimilarity of it to all exemplars of Category B, as \(\nu _A = \sum _{j \in A}w_j S_{ij} + \sum _{j \in B}w_j DS_{ij}\), where \(w_j\) (\(0 \le w_j \le 1\)) is the parameter to weight the contribution of the exemplar \(x_j\) for categorization, which was treated as the accessibility of the exemplar \(x_j\) in this study. Likewise, the valence for Category B is computed as \(\nu _B = \sum _{j \in B}w_j S_{ij} + \sum _{j \in A}w_j DS_{ij}\). According to the SD-GCM, an item tends to be classified as one category if it is more similar to the exemplars of that category and more dissimilar to those of the contrasting category. In this study, it was assumed that the exemplars in the same category share the same accessibility. Thus, in “Experiment 1”, we estimated two accessibilities \(w_A\) and \(w_B\) for Category A and Category B, respectively. In “Experiment 2”, the accessibility of each exemplar was set up as 1, as there was only one category and no need to assess the exemplar accessibility of each category. Regardless of the experiment, the parameters of the SD-GCM were estimated by Bayesian inference, which was conducted by using the R package {rjags} with two chains each collecting 1000 posterior samples for each parameter and each response of the SDGCM, without setting up burn-in samples.

Bayesian modeling with the SDGCM for Experiment 1

In “Experiment 1”, four parameters, c, \(\gamma \), \(\alpha _x\), and \(\lambda \) were estimated for the data of each participant in the two conditions. The first three had uniform priors, as \(c\sim U(1,10)\), \(\gamma \sim U(1,20)\), and \(\alpha _x\sim U(0,1)\). As the data in “Experiment 2” were the probability of Category A (or 1 - Probability of Category B for Condition B) for each transfer item predicted by the participants, the likelihood for the SDGCM to generate each data point was computed by a Gaussian distribution function with the mean as the Probability of Category A computed by the SD-GCM for each transfer item and the precision as \(\lambda = 1/\sigma ^2\), where \(\sigma ^2\) is the variance of normal distribution. The precision parameter \(\lambda \) had a Gamma prior, \(\lambda \sim Gamma(0.001,0.001)\).

Bayesian modeling with the SDGCM for Experiment 2

Five parameters c, \(\gamma \), \(\alpha _x\), \(w_A\), and \(w_B\), were estimated for each participant in the XOR and Proximity groups. The first three had uniform priors as \(c\sim U(1,30)\), \(\gamma \sim U(3,20)\), and \(\alpha _x\sim U(0,1)\). The parameters \(w_A\) and \(w_B\) also had uniform priors, except that their intervals depended on the latent parameter g, which had a Bernoulli prior, \(g \sim Bern(0.5)\). When \(g = 0\) (for the XOR group), \(w_A \sim U(0.51,1)\) and \(w_B\sim U(0,0.49)\), while when \(g = 1\) (for the Proximity group), \(w_A\sim U(0,0.49)\) and \(w_B\sim U(0.51,1)\), as it was hypothesized that the XOR pattern and the Proximity pattern, respectively, result from referencing more Category A and Category B. The parameters \(w_A\) and \(w_B\) independently estimated, because it was not necessary to assume the total accessibility as a constant. The likelihood for the SDGCM to generate the observed response for each transfer item for each participant was computed by the binomial distribution function, given the probability of Category A computed by the SDGCM.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Yang, LX., Chiang, PA. Post-training flexibility in category learning. Psychon Bull Rev (2024). https://doi.org/10.3758/s13423-023-02451-7

Download citation

Accepted: 28 December 2023
Published: 05 February 2024
DOI: https://doi.org/10.3758/s13423-023-02451-7

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Post-training flexibility in category learning

Abstract

Access this article

Similar content being viewed by others

Generalization in Distant Regions of a Rule-Described Category Space: a Mixed Exemplar and Logical-Rule-Based Account

Does practice in category learning increase rule use or exemplar use—or both?

Individual differences in category learning: Memorization versus rule abstraction

Open Practices Statement

Notes

References

Funding

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Appendix: Computational modeling with the SDGCM

Bayesian modeling with the SDGCM for Experiment 1

Bayesian modeling with the SDGCM for Experiment 2

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Post-training flexibility in category learning

Abstract

Access this article

Similar content being viewed by others

Generalization in Distant Regions of a Rule-Described Category Space: a Mixed Exemplar and Logical-Rule-Based Account

Does practice in category learning increase rule use or exemplar use—or both?

Individual differences in category learning: Memorization versus rule abstraction

Open Practices Statement

Notes

References

Funding

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Appendix: Computational modeling with the SDGCM

Appendix: Computational modeling with the SDGCM

Bayesian modeling with the SDGCM for Experiment 1

Bayesian modeling with the SDGCM for Experiment 2

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation