Advertisement

Data Mining and Knowledge Discovery

, Volume 21, Issue 3, pp 398–426 | Cite as

A weighted voting summarization of SOM ensembles

  • Bruno Baruque
  • Emilio Corchado
Article

Abstract

Weighted Voting Superposition is a novel summarization algorithm for the results of an ensemble of Self-Organizing Maps. Its principal aim is to achieve the lowest topographic error in the map in order to obtain the best possible visualization of the internal structure of the data sets under study. This is done by means of a weighted voting process between the neurons of the ensemble maps in order to determine the characteristics of the neurons in the resulting map. The algorithm is applied in this case to the most widely known topology preserving mapping architecture: the Self- Organizing Map. A comparison is made between the novel fusion algorithm presented in this work and other previously devised fusion algorithms, along with a new variation of those algorithms, called Ordered Similarity. Although a practical example of the new algorithm was introduced in an earlier work, a rigorous description and analysis is presented here for the first time by comparing the performance of the aforementioned algorithms in relation to three well-known data sets (Iris, Wisconsin Breast Cancer and Wine) obtained from Internet repositories. The results show how this novel fusion algorithm outperforms the other fusion algorithms, yielding better visualization results for ensemble summarization of maps.

Keywords

Data visualization Topology preservation Ensemble learning Self-organizing maps 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Asuncion A, Newman DJ (2007) UCI machine learning repository. University of California, Irvine, School of Information and Computer SciencesGoogle Scholar
  2. Aurenhammer F, Klein R (2000) Voronoi diagrams. In: Handbook of computational geometry, number 5. North-Holland, Amsterdam, Netherlands pp 201–290Google Scholar
  3. Baruque B, Corchado E, Rovira J, Gonzalez J (2008) Application of topology preserving ensembles for sensory assessment in the food industry. In: Intelligent data engineering and automated learning (IDEAL 2008). Lecture notes in computer science, vol 5326. Springer, Berlin/Heidelberg, pp 491–497Google Scholar
  4. Baruque B, Corchado E, Yin H (2007) ViSOM ensembles for visualization and classification. In: 9th international work-conference on artificial neural networks (IWANN’07), San Sebastian, Spain. Lecture notes in computer science, vol 4507. Springer, Berlin/Heidelberg, pp 235–243Google Scholar
  5. Blackmore J, Miikkulainen R (1993) Incremental grid growing: encoding high-dimensional structure into a two-dimensional feature map. In: Proceedings of the IEEE international conference on neural networks, vol 1, pp 450–455, ISBN 0-7803-0999-5Google Scholar
  6. Brassard G, Bratley P (1995) Fundamentals of Algorithmics. Prentice Hall; US ed edition, August 1995, p 524 ISBN 0133350681Google Scholar
  7. Breiman L (1996) Bagging predictors. Mach Learn 24(2): 123–140MATHMathSciNetGoogle Scholar
  8. Corchado E, Baruque B, Yin H (2007) Boosting unsupervised competitive learning ensembles. In: Artificial neural networks (ICANN 2007), Porto, Portugal. Lecture notes in computer science, vol 4668. Springer, Berlin/Heidelberg, pp 339–348Google Scholar
  9. Freund Y, Schapire RE (1996) Experiments with a new boosting algorithm. In: International conference on machine learning, pp 148–156Google Scholar
  10. Georgakis A, Li H, Gordan M (2005) An ensemble of SOM networks for document organization and retrieval. In: International conference on adaptive knowledge representation and reasoning (AKRR’05). Espoo, Finland, pp 6–141Google Scholar
  11. Gianniotis N, Tino P (2008) Visualization of tree-structured data through generative topographic mapping. IEEE T Neural Networ 19(8): 1468–1493CrossRefGoogle Scholar
  12. Heskes T (1997) Balancing between bagging and bumping. In: Mozer MC, Jordan MI, Petsche T (eds) Advances in neural information processing systems. MIT Press, Denver, CO, pp 466–472Google Scholar
  13. Hotelling H (1933) Analysis of a complex of statistical variables into principal components. J Educ Psychol 24: 417–444CrossRefGoogle Scholar
  14. Johansson U, Lofstrom T, Niklasson L (2006) Obtaining accurate neural network ensembles. In: Mohammadian M (ed) International conference on computational intelligence for modelling, control & automation jointly with international conference on intelligent agents, web technologies & internet commerce, vol 2, proceedings, pp 103–108Google Scholar
  15. Kaski S, Lagus K (1996) Comparing self-organizing maps. In: von der Malsburg, von Seelen W, Vorbruggen JC, Sendhoff B (eds). In: ICANN 96: proceedings of the 1996 international conference on artificial neural networks, Lecture notes in computer science. Springer-Verlag, London, UK, pp 809–814Google Scholar
  16. Kiviluoto K (1996) Topology preservation in self-organizing maps. In: IEEE international conference on neural networks (ICNN’96), vol 1, pp 294–299Google Scholar
  17. Kohonen T (1988) An introduction to neural computing. Neural Networks 1(1): 3–16 ISSN 0893-6080CrossRefGoogle Scholar
  18. Kohonen T (1995) Self-organizing maps. In: Series in information sciences, vol 30. Springer, BerlinGoogle Scholar
  19. Kohonen T, Lehtio P, Rovamo J, Hyvarinen J, Bry K, Vainio L (1977) A principle of neural associative memory. Neuroscience 2(6): 1065–1076CrossRefGoogle Scholar
  20. Kohonen T, Oja E, Simula O, Visa A, Kangas J (1996) Engineering applications of the self-organizing map. In: Proceedings of the IEEE, vol 84, pp 1358–1384Google Scholar
  21. Kuncheva LI (2004) Combining pattern classifiers: methods and algorithms. Wiley-Interscience, New JerseyMATHCrossRefGoogle Scholar
  22. Kuncheva LI, Skurichina M, Duin RPW (2002) An experimental study on diversity for bagging and boosting with linear classifiers. Inf Fusion 3(4): 245–258CrossRefGoogle Scholar
  23. Lampinen J, Oja E (1992) Clustering properties of hierarchical self-organizing maps. J Math Imaging Vis 2: 261–272MATHCrossRefGoogle Scholar
  24. Ling CX (1995) Overfitting and generalization in learning discrete patterns. Neurocomputing 8(3): 341–347CrossRefGoogle Scholar
  25. Patra JC, Ang EL, Meher PK, Zhen Q (2006) A new SOM-based visualization technique for dna microarray data. In: International joint conference on neural networks (IJCNN’06), pp 4429–4434Google Scholar
  26. Petrakieva L, Fyfe C (2003) Bagging and bumping self organising maps. Comput Inf Syst J 1352–9404Google Scholar
  27. Polani D (2003) Measures for the organization of self-organizing maps. In: Seiffert U, Jain LC (eds) Self-organizing neural networks: recent advances and applications studies in fuzziness and soft computing, vol 16. Physica-Verlag, Heidelberg, pp 13–44Google Scholar
  28. Pozlbauer G (2004) Survey and comparison of quality measures for self-organizing maps. In: Rauber JP, Polzlbauer G, Andreas R (eds) 5th workshop on data analysis (WDA’04). Elfa Academic Press, Slovakia, pp 67–82Google Scholar
  29. Ruta D, Gabrys B (2005) Classifier selection for majority voting. Inf Fusion 6(1): 63–81CrossRefGoogle Scholar
  30. Saavedra C, Salas R, Moreno S, Allende H (2007) Fusion of self organizing maps. In: 9th international work-conference on artificial neural networks (IWANN 2007). Lecture notes in computer science. Springer, Berlin/Heidelberg, pp 227–234Google Scholar
  31. Schwenk H, Bengio Y (2000) Boosting neural networks. Neu Compu 12(8): 1869–1887CrossRefGoogle Scholar
  32. Vesanto J, Sulkava M, Hollmen J (2003) On the decomposition of the self-organizing map distortion measure. In: Proceedings of the workshop on self-organizing maps (WSOM’03), pp 11–16Google Scholar
  33. Zhang G, Patuwo BE, Hu MY (1998) Forecasting with artificial neural networks: the state of the art. International Journal of Forecasting 14(1): 35–62CrossRefGoogle Scholar

Copyright information

© The Author(s) 2010

Authors and Affiliations

  1. 1.Department of Civil EngineeringUniversity of BurgosBurgosSpain
  2. 2.Departamento de Informática y AutomáticaUniversidad de SalamancaSalamancaSpain

Personalised recommendations