Applying the Kohonen Self-Organizing Map Networks to Select Variables
The problem of selection of variables seems to be the key issue in classification of multi-dimensional objects. An optimal set of features should be made of only those variables, which are essential for the differentiation of studied objects. This selection may be made easier if a graphic analysis of an U-matrix is carried out. It allows to easily identify variables, which do not differentiate the studied objects. A graphic analysis may, however, not suffice to analyse data when an object is described with hundreds of variables. The authors of the paper propose a procedure which allows to eliminate variables with the smallest discriminating potential based on the measurement of concentration of objects on the Kohonen self organising map networks.
KeywordsGroup Structure Graphic Analysis Concentration Index Select Variable Silhouette Index
Unable to display preview. Download preview PDF.
- GORDON A.D. (1999), Classification , Chapman and Hall / CRC, London, p.3 KOHONEN T. (1997), Self-Organizing Maps, Springer Series in Information Sciences, Springer-Verlag, Berlin Heidelberg.Google Scholar
- MILLIGAN G.W. (1994), Issues in Applied Classification: Selection of Variables to Cluster, Classification Society of North America News Letter, November Issue 37.Google Scholar
- MILLIGAN G.W. (1996), Clustering validation: Results and implications for applied analy-ses. In Phipps Arabie, Lawrence Hubert & G. DeSoete (Eds.), Clustering and classifica-tion, River Edge, NJ: World Scientific, p. 341-375.Google Scholar
- MIGDAĐ NAJMAN K., NAJMAN K. (2003), Zastosowanie sieci neuronowej typu SOM w badaniu przestrzennego zróŻnicowania powiatów, Wiadomosci Statystyczne, 4/2003, p. 72-85.Google Scholar
- VESANTO J. (1997), Data Mining Techniques Based on the Self Organizing Map, Thesis for the degree of Master of Science in Engineering, Helsinki University of Technology.Google Scholar