Abstract
This paper investigates a possibility of supplementing standard dimensionality reduction procedures, used in the process of knowledge extraction from multidimensional datasets, with topology preservation measures. This approach is based on an observation that not all elements of an initial dataset are equally preserved in its low-dimensional embedding space representation. The contribution first overviews existing topology preservation measures, then their inclusion in the classical methods of exploratory data analysis is being discussed. Finally, some illustrative examples of presented approach in the tasks of cluster analysis and classification are being given.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Borg, I., Groenen, P.J.F.: Modern Multidimensional Scaling: Theory and Applications. Springer, Heidelberg (2010)
Everitt, B.S., Landau, S., Leese, M., Stahl, D.: Cluster Analysis. Wiley, New York (2011)
Furht, B., Escalante, A. (eds.): Handbook of Data Intensive Computing. Springer, Heidelberg (2011)
Karbauskaite, R., Dzemyda, G.: Topology Preservation Measures in the Visualization of Manifold-Type Multidimensional Data. Informatica 20, 235â254 (2009)
Kerdprasop, K., Kerdprasop, N., Sattayatham, P.: Weighted K-Means for Density-Biased Clustering. In: Tjoa, A.M., Trujillo, J. (eds.) DaWaK 2005. LNCS, vol. 3589, pp. 488â497. Springer, Heidelberg (2005)
Konig, A.: Interactive Visualization and Analysis of Hierarchical Neural Projections for Data Mining. IEEE Transactions on Neural Networks 11(3), 615â624 (2000)
Lee, J.A., Verleysen, M.: Nonlinear Dimensionality Reduction. Springer, New York (2007)
Ĺukasik, S., Kulczycki, P.: An Algorithm for Sample and Data Dimensionality Reduction Using Fast Simulated Annealing. In: Tang, J., King, I., Chen, L., Wang, J. (eds.) ADMA 2011, Part I. LNCS (LNAI), vol. 7120, pp. 152â161. Springer, Heidelberg (2011)
Ĺukasik, S., Kulczycki, P.: Using Topology Preservation Measures for High-Dimensional Data Analysis in a Reduced Feature Space. Technical Transactions 1-AC, 5â16 (2012) (in Polish)
van der Maaten, L.J.P., Postma, E.O., Herik, H.J.: Dimensionality Reduction: A Comparative Review. Tilburg University Technical Report, TiCC-TR 2009-005 (2009)
Parvin, H., Alizadeh, H., Minati, B.: A Modification on K-Nearest Neighbor Classifier. Global Journal of Computer Science and Technology 10, 37â41 (2010)
Sammon, J.W.: A Nonlinear Mapping for Data Structure Analysis. IEEE Transactions on Computers 18, 401â409 (1969)
Sammut, C., Webb, G.I. (eds.): Encyclopedia of Machine Learning. Springer, New York (2011)
Silva, V.D., Tenenbaum, J.B.: Global versus Local Methods in Nonlinear Dimensionality Reduction. In: Becker, S., Thrun, S., Obermayer, K. (eds.) Advances in Neural Information Processing Systems, vol. 15, pp. 705â712. MIT Press, Cambridge (2003)
UCI Machine Learning Repository, http://archive.ics.uci.edu/ml/
Verleysen, M., François, D.: The Curse of Dimensionality in Data Mining and Time Series Prediction. In: Cabestany, J., Prieto, A.G., Sandoval, F. (eds.) IWANN 2005. LNCS, vol. 3512, pp. 758â770. Springer, Heidelberg (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
Š 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ĺukasik, S., Kulczycki, P. (2013). Using Topology Preservation Measures for Multidimensional Intelligent Data Analysis in the Reduced Feature Space. In: Rutkowski, L., Korytkowski, M., Scherer, R., Tadeusiewicz, R., Zadeh, L.A., Zurada, J.M. (eds) Artificial Intelligence and Soft Computing. ICAISC 2013. Lecture Notes in Computer Science(), vol 7895. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-38610-7_18
Download citation
DOI: https://doi.org/10.1007/978-3-642-38610-7_18
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-38609-1
Online ISBN: 978-3-642-38610-7
eBook Packages: Computer ScienceComputer Science (R0)