A Simple Method for Generating Additive Clustering Models with Limited Complexity

Lee, Michael D.

doi:10.1023/A:1014112506867

A Simple Method for Generating Additive Clustering Models with Limited Complexity

Published: October 2002

Volume 49, pages 39–58, (2002)
Cite this article

Download PDF

Machine Learning Aims and scope Submit manuscript

A Simple Method for Generating Additive Clustering Models with Limited Complexity

Download PDF

Michael D. Lee¹

650 Accesses
9 Citations
Explore all metrics

Abstract

Additive clustering was originally developed within cognitive psychology to enable the development of featural models of human mental representation. The representational flexibility of additive clustering, however, suggests its more general application to modeling complicated relationships between objects in non-psychological domains of interest. This paper describes, demonstrates, and evaluates a simple method for learning additive clustering models, based on the combinatorial optimization approach known as Population-Based Incremental Learning. The performance of this new method is shown to be comparable with previously developed methods over a set of ‘benchmark’ data sets. In addition, the method developed here has the potential, by using a Bayesian analysis of model complexity that relies on an estimate of data precision, to determine the appropriate number of clusters to include in a model.

References

Akaike, H. (1974). A new look at the statistical model identification. IEEE Transactions on Automatic Control, 19, 716-723.
Google Scholar
Arabie, P., & Carroll, J. D. (1980). MAPCLUS: A mathematical programming approach to fitting the ADCLUS model. Psychometrika, 45:2, 211-235.
Google Scholar
Arabie, P., Carroll, J. D., & DeSarbo, W. S. (1987). Three-way scaling and clustering. Newbury Park, CA: Sage.
Google Scholar
Arabie, P., & Shepard, R. N. (1973). Representation of similarities as combinations of discrete, overlapping properties. In Mathematical Psychological Meeting, Montréal.
Baluja, S. (1994). Population-based incremental learning: A method for integrating genetic search based function optimization and competitive learning. Technical Report CMU-CS-94-163, Carnegie-Mellon University.
Baluja, S., & Davies, S. (1997). Using optimal dependency-trees for combinatorial optimization: Learning the structure of the search space. Technical Report CS-CMU-97-107, Carnegie-Mellon University.
Baluja, S., & Davies, S. (1998). Fast probabilistic modeling for combinatorial optimization. In Proceedings of the Fifteenth National Conference on Artificial Intelligence (pp. 469-476). Madison, WI: AAAI Press/MIT Press.
Google Scholar
Boyan, J. A., & Moore, A. W. (1998). Learning evaluation function for global optimization and boolean satisfiability. In Proceedings of the Fifteenth National Conference on Artificial Intelligence (pp. 3-10). Madison, WI: AAAI Press/MIT Press.
Google Scholar
Breiger, R. L., Boorman, S. A., & Arabie, P. (1975). An algorithm for clustering relational data with applications to social network analysis and comparison with multidimensional scaling. Journal of Mathematical Psychology, 12, 328-383.
Google Scholar
Chaturvedi, A., & Carroll, J. D. (1994).Analternating combinatorial optimization approach to fitting theINDCLUS and generalized INDCLUS models. Journal of Classification, 11, 155-170.
Google Scholar
Cox, T. F., & Cox, M. A. A. (1994). Multidimensional scaling. London: Chapman and Hall.
Google Scholar
Ekman, G. (1954). Dimensions of color vision. The Journal of Psychology, 38, 467-474.
Google Scholar
Gati, I., & Tversky, A. (1982). Representations of qualitative and quantitative dimensions. Journal of Experimental Psychology: Human Perception and Performance, 8:2, 325-340.
Google Scholar
Gregson, R. A. M. (1976). A comparative evaluation of seven similarity models. British Journal of Mathematical and Statistical Psychology, 29, 139-156.
Google Scholar
Grünwald, P. (2000). Model selection based on minimum description length. Journal of Mathematical Psychology, 44:1, 133-152.
Google Scholar
Hertz, J., Krogh, A., & Palmer, R. G. (1991). Introduction to the theory of neural computing. Redwood City, CA: Addison-Wesley.
Google Scholar
Hojo, H. (1982).Amaximumlikelihood method for additive clustering and its applications. Japanese Psychological Research, 25:4, 191-201.
Google Scholar
Johnson, E. J., & Tversky, A. (1984). Representations of perceptions of risks. Journal of Experimental Psychology: General, 113:1, 55-70.
Google Scholar
Kass, R. E., & Raftery, A. E. (1995). Bayes factors. Journal of the American Statistical Association, 90:430, 773-795.
Google Scholar
Kruschke, J. K. (1993). Human category learning: Implications for backpropagation models. Connection Science, 5, 3-36.
Google Scholar
Lawson, C. L., & Hanson, R. J. (1974). Solving least squares problems. Englewood Cliffs, NJ: Prentice-Hall.
Google Scholar
Lee, M. D. (1998). Active cognitive representation and learned categorical perception. In 25th Australasian Experimental Psychology Conference, Hobart.
Lee, M. D. (1999a). Algorithms for representing similarity data. Defence Science and Technology Organisation Research Report DSTO-RR-0152.
Lee, M. D. (1999b). An extraction and regularization approach to additive clustering. Journal of Classification, 16:2, 255-281.
Google Scholar
Lee, M. D. (2001a). Determining the dimensionality of multidimensional scaling representations for cognitive modeling. Journal of Mathematical Psychology, 45:1, 149-166.
Google Scholar
Lee, M. D. (2001b). On the complexity of additive clustering models. Journal of Mathematical Psychology, 45:1, 131-148.
Google Scholar
Miller, G. A., & Nicely, P. E. (1955). An analysis of perceptual confusions among some English consonants. Journal of the Acoustical Society of America, 27, 338-352.
Google Scholar
Mirkin, B. G. (1987). Additive clustering and qualitative factor analysis methods for similarity matrices. Journal of Classification, 4, 7-31.
Google Scholar
Myung, I. J., Balasubramanian, V., & Pitt, M. A. (2000). Counting probability distributions: Differential geometry and model selection. In Proceedings of the National Academy of Sciences, 97, 11170-11175.
Google Scholar
Myung, I. J., & Pitt, M. A. (1997). Applying Occam's razor in modeling cognition: A Bayesian approach. Psychonomic Bulletin & Review, 4:1, 79-95.
Google Scholar
Raftery, A. E. (1999). Bayes factors and BIC: Comment on Weakliem. Sociological Methods and Research, 27, 411-427.
Google Scholar
Roethlisberger, F. J., & Dickson, W. J. (1939). Management and the worker. Cambridge, MA: Harvard University Press.
Google Scholar
Rosenberg, S., & Kim, M. P. (1975). The method of sorting as a data-generating procedure in multivariate research. Multivariate Behavioral Research, 10, 489-502.
Google Scholar
Schwarz, G. (1978). Estimating the dimension of a model. Annals of Statistics, 6:2, 461-464.
Google Scholar
Shepard, R. N. (1974). Representation of structure in similarity data: Problems and prospects. Psychometrika, 39:4, 373-422.
Google Scholar
Shepard, R. N., & Arabie, P. (1979). Additive clustering representations of similarities as combinations of discrete overlapping properties. Psychological Review, 86:2, 87-123.
Google Scholar
Shepard, R. N., Kilpatrick, D.W., & Cunningham, J. P. (1975). The internal representation of numbers. Cognitive Psychology, 7, 82-138.
Google Scholar
Tenenbaum, J. B. (1996). Learning the structure of similarity. In D. S. Touretzky, M. C. Mozer, & M. E. Hasselmo (Eds.), Advances in neural information processing systems (Vol. 2). Cambridge, MA: MIT Press.
Google Scholar
Woodruff, C. J. (1998). Establishing a psychophysics of texture. In 25th Australasian Experimental Psychology Conference, Hobart.

Download references

Author information

Authors and Affiliations

Department of Psychology, University of Adelaide, SA, 5005, Australia
Michael D. Lee

Authors

Michael D. Lee
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Lee, M.D. A Simple Method for Generating Additive Clustering Models with Limited Complexity. Machine Learning 49, 39–58 (2002). https://doi.org/10.1023/A:1014112506867

Download citation

Issue Date: October 2002
DOI: https://doi.org/10.1023/A:1014112506867

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

A Simple Method for Generating Additive Clustering Models with Limited Complexity

Abstract

Article PDF

Similar content being viewed by others

A Heuristic Automatic Clustering Method Based on Hierarchical Clustering

Holistic Assessment of Structure Discovery Capabilities of Clustering Algorithms

Hierarchical Clustering for Large Data Sets

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Navigation

A Simple Method for Generating Additive Clustering Models with Limited Complexity

Abstract

Article PDF

Similar content being viewed by others

A Heuristic Automatic Clustering Method Based on Hierarchical Clustering

Holistic Assessment of Structure Discovery Capabilities of Clustering Algorithms

Hierarchical Clustering for Large Data Sets

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation