Abstract
An overview over a new release of the statistical software ClusCorr98 will be given. The emphasis of this software lies on an extended collection of exploratory and model-based clustering techniques with in-built validation via resampling. Using special weights of observations leads to well-known resampling techniques. By doing so, the appropriate number of clusters can be validated. As an illustration of an interesting feature of ClusCorr98, a general validation of results of hierarchical clustering based on the adjusted Rand index is recommended. It is applied to demographical data from economics. Here the stability of each cluster can be assessed additionally.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
BANFIELD, J.D. and RAFTERY, A.E. (1993): Model-Based Gaussian and non-Gaussian Clustering. Biometrics, 49, 803–821.
CIA World Factbook (1999): Population by Country. http://www.geographic.org.
FRALEY, C. (1996): Algorithms for model-based Gaussian Hierarchical Clustering. Technical Report, 311. Department of Statistics, University of Washington, Seattle.
GORDON, A.D. (1999): Classification. Chapman & Hall/CRC, London.
GOWER, J.C. (1971): A General Coefficient of Similarity and some of its Properties. Biometrics, 27, 857–874.
HUBERT, L.J. and ARABIE, P. (1985): Comparing Partitions. Journal of Classification, 2, 193–218.
JAIN, A.K. and DUBES, R.C. (1988): Algorithms for Clustering Data. Prentice Hall, New Jersey.
KAUFMAN, L. and ROUSSEEUW, P.J. (1990): Finding Groups in Data. Wiley, New York.
MUCHA, H.-J. (1992): Clusteranalyse mit Mikrocomputern. Akademie Verlag, Berlin.
MUCHA, H.-J., BARTEL, H.-G., and DOLATA, J. (2002a): Exploring Roman Brick and Tile by Cluster Analysis with Validation of Results. In: W. Gaul and G. Ritter (Eds.): Classification, Automation, and New Media. Springer, Heidelberg, 471–478.
MUCHA, H.-J., BARTEL, H.-G., and DOLATA, J. (2003): Core-based Clustering Techniques. In: M. Schader, W. Gaul, and M. Vichi (Eds.): Between Data Science and Applied Data Analysis. Springer, Berlin, 74–82.
MUCHA, H.-J, SIMON, U, and BRÜGGEMANN, R. (2002b): Model-based Cluster Analysis Applied to Flow Cytometry Data of Phytoplankton. Weierstraß Institute for Applied Analysis and Stochastic, Technical Report No. 5. http://www.wias-berlin.de/.
RAND, W.M. (1971): Objective Criteria for the Evaluation of Clustering Methods. Journal of the American Statistical Association, 66, 846–850.
WARD, J.H. (1963): Hierarchical Grouping Methods to Optimise an Objective Function. JASA, 58, 235–244.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin · Heidelberg
About this paper
Cite this paper
Mucha, HJ., Bartel, HG. (2005). ClusCorr98 - Adaptive Clustering, Multivariate Visualization, and Validation of Results. In: Baier, D., Wernecke, KD. (eds) Innovations in Classification, Data Science, and Information Systems. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-26981-9_6
Download citation
DOI: https://doi.org/10.1007/3-540-26981-9_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23221-6
Online ISBN: 978-3-540-26981-6
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)