Abstract
Interval-valued data arise in practical situations such as recording monthly interval temperatures at meteorological stations, daily interval stock prices, etc. This paper presents a comparison study for clustering efficiency (according to adjusted Rand index) for spectral, ensemble, and spectral-mean shifted clustering methods for symbolic data. Evaluation studies with application of artificial data with known cluster structure (obtained from mlbench and clusterSim packages of R) show the usefulness and stable results of the ensemble clustering compared to spectral and spectral-mean shift method.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Breiman, L. (1996). Bagging predictors. Machine Learning, 24(2), 123–140.
Bock, H.-H., & Diday, E. (Eds.) (2000). Analysis of symbolic data. Explanatory methods for extracting statistical information from complex data. Berlin/Heidelberg: Springer.
Billard, L., & Diday, E. (2006). Symbolic data analysis: Conceptual statistics and data mining. Chichester: Wiley.
Cheng, Y. (1995). Mean shift, mode seeking, and clustering. IEEE Transactions on Pattern Analysis and Machine Intelligence, 17(8), 790–799.
Comaniciu, D., Ramesh, V., & Meer, P. (2001). The variable bandwidth mean shift and data-driven scale selection. In International Conference on Computer Vision (Vol. I, pp. 438–445).
Comaniciu, D., Ramesh, V., & Meer, P. (2003). Kernel-based object tracking. IEEE Transactions on Pattern Analysis and Machine Intelligence, 25(5), 564–577.
de Carvalho, F. A. T., Lechevallier, Y., & de Melo, F. M. (2012). Partitioning hard clustering algorithms based on multiple dissimilarity matrices. Pattern Recognition, 45(1), 447–464.
Dudoit, S., & Fridlyand, J. (2003). Bagging to improve the accuracy of a clustering procedure. Bioinformatics, 19(9), 1090–1099.
Fred, A. L. N., & Jain, A. K. (2005). Combining multiple clustering using evidence accumulation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 27, 835–850.
Fukunaga, K., & Hostetler, L. (1975) The estimation of the gradient of a destiny function, with applications in pattern recognition. IEEE Transactions on Information Theory, 21(1), 32–40.
Ghaemi, R., Sulaiman, N., Ibrahim, H., & Mustapha, N. (2009). A survey: Clustering ensemble techniques. In Proceedings of World Academy of Science, Engineering and Technology (Vol. 38, pp. 636–645).
Hornik, K. (2005). A CLUE for CLUster ensembles. Journal of Statistical Software, 14, 65–72.
Karatzoglou, A. (2006). Kernel Methods. Software, Algorithms and Applications. Doctoral thesis, Vienna University of Technology.
Leisch, F. (1999). Bagged clustering. Adaptive Information Systems and Modeling in Economics and Management Science, Working Papers, SFB, 51.
Milligan, G. W, & Cooper, M. C. (1985). An examination of procedures for determining the number of clusters in a data set. Psychometrika, 2, 159–179.
Ng, A., Jordan, M., & Weiss, Y. (2002). On spectral clustering: Analysis and an algorithm. In T. Dietterich, S. Becker, & Z. Ghahramani (Eds.), Advances in Neural Information Processing Systems 14 (pp. 849–856). Cambridge: MIT Press.
Pełka, M. (2012). Ensemble approach for clustering of interval-valued symbolic data. Statistics in Transition, 13(2), 335–342.
Polikar, R. (2006). Ensemble based systems in decision making. IEEE Circuits and Systems Magazine, 6(3), 21–45.
Polikar, R. (2007). Bootstrap inspired techniques in computional intelligence: Ensemble of classifiers, incremental learning, data fusion and missing features. IEEE Signal Processing Magazine, 24(4), 59–72.
Qiu, W., & Joe, H. (2006). Generation of random clusters with specified degree of separation. Journal of Classification, 23, 315–334.
Shi, J., & Malik, J. (2000). Normalized cuts and image segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 22(8), 888–905.
Stehl, A., & Gosh, J. (2002). Cluster ensembles – A knowledge reuse framework for combining multiple partitions. Journal of Machine Learning Research, 3, 583–618.
von Luxburg, U. (2006). A tutorial on spectral clustering. Max Planck Institute for Biological Cybernetics, Technical Report TR-149.
Walesiak, M., & Dudek, A. (2014). The clusterSim package, http://www.R-project.org
Zhi-Hua, Z. (2012). Ensemble methods. Foundations and algorithms. Boca Raton: CRC Press.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Pełka, M. (2016). A Comparison Study for Spectral, Ensemble and Spectral-Mean Shift Clustering Approaches for Interval-Valued Symbolic Data. In: Wilhelm, A., Kestler, H. (eds) Analysis of Large and Complex Data. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Cham. https://doi.org/10.1007/978-3-319-25226-1_12
Download citation
DOI: https://doi.org/10.1007/978-3-319-25226-1_12
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-25224-7
Online ISBN: 978-3-319-25226-1
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)