Assessment of an Unsupervised Feature Selection Method for Generative Topographic Mapping

  • Alfredo Vellido
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4132)


Feature selection (FS) has long been studied in classification and regression problems. In comparison, FS for unsupervised learning has received far less attention. For many real problems concerning unsupervised data clustering, FS becomes an issue of paramount importance. An unsupervised FS method for Gaussian Mixture Models, based on Feature Relevance Determination (FRD), was recently defined. Unfortunately, the data visualization capabilities of general mixture models are limited. Generative Topographic Mapping (GTM), a constrained mixture model, was originally defined to overcome such limitation. In this brief study, we test in some detail the capabilities of a recently described FRD method for GTM that allows the clustering results to be intuitively visualized and interpreted in terms of a reduced subset of selected relevant features.


Feature Selection Mixture Model Gaussian Mixture Model Finite Mixture Model Adaptive Parameter 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    McLachlan, G.J., Peel, D.: Finite Mixture Models. John Wiley & Sons, New York (2000)MATHCrossRefGoogle Scholar
  2. 2.
    Wong, P.C.: Visual data mining. IEEE Comput. Graph 19(5), 20–21 (1999)CrossRefGoogle Scholar
  3. 3.
    Bishop, C.M., Svensén, M., Williams, C.K.I.: GTM: The Generative Topographic Mapping. Neural Comput. 10(1), 215–234 (1998)CrossRefGoogle Scholar
  4. 4.
    Kohonen, T.: Self-Organizing Maps, 3rd edn. Springer, Berlin (2000)Google Scholar
  5. 5.
    Law, M.H.C., Figueiredo, M.A.T., Jain, A.K.: Simultaneous Feature Selection and Clustering Using Mixture Models. IEEE T. Pattern Anal. 26(9), 1154–1166 (2004)CrossRefGoogle Scholar
  6. 6.
    Vellido, A., Lisboa, P.J.G., Vicente, D.: Robust Analysis of MRS Brain Tumour Data Using t-GTM. Neurocomputing 69(7-9), 754–768 (2006)CrossRefGoogle Scholar
  7. 7.
    Dempster, A.P., Laird, M.N., Rubin, D.B.: Maximum Likelihood from Incomplete Data via the EM Algorithm. J. Roy. Stat. Soc. B 39(1), 1–38 (1977)MATHMathSciNetGoogle Scholar
  8. 8.
    MacKay, D.J.C.: Bayesian Methods for Back-Propagation Networks. In: Domany, E., van Hemmen, J.L., Schulten, K. (eds.) Models of Neural Networks III, pp. 211–254. Springer, New York (1994)Google Scholar
  9. 9.
    Andrade, A., Vellido, A.: Determining Feature Relevance for the Grouping of Motor Unit Action Potentials through Generative Topographic Mapping. In: Proc. of the 25th IASTED International Conference Modelling, Identification, and Control (MIC 2006), pp. 507–512 (2006)Google Scholar
  10. 10.
    Dy, J.G., Brodley, C.E.: Feature Selection for Unsupervised Learning. J. Mach. Learn. Res. 5, 845–889 (2004)MathSciNetGoogle Scholar
  11. 11.
    Dash, M., Liu, H., Yao, J.: Dimensionality Reduction for Unsupervised Data. In: Proc. Of the 9th Int. Conf. on Tools with Artificial Intelligence (TAI 1997), pp. 532–539 (1997)Google Scholar
  12. 12.
    Hunter, A.: Feature Selection Using Probabilistic Neural Networks. Neural Comput. Appl. 9(2), 124–132 (2000)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Alfredo Vellido
    • 1
  1. 1.Department of Computing Languages and Systems (LSI)Polytechnic University of Catalonia (UPC)BarcelonaSpain

Personalised recommendations