The Block Generative Topographic Mapping

  • Rodolphe Priam
  • Mohamed Nadif
  • Gérard Govaert
Part of the Lecture Notes in Computer Science book series (LNCS, volume 5064)

Abstract

This paper presents a generative model and its estimation allowing to visualize binary data. Our approach is based on the Bernoulli block mixture model and the probabilistic self-organizing maps. This leads to an efficient variant of Generative Topographic Mapping. The obtained method is parsimonious and relevant on real data.

Keywords

Mixture Model Binary Data Latent Variable Model Mixture Approach Block Cluster 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

  1. 1.
    Lebart, L., Morineau, A., Warwick, K.: Multivariate Descriptive Statistical Analysis. J. Wiley, Chichester (1984)MATHGoogle Scholar
  2. 2.
    Kohonen, T.: Self-organizing maps. Springer, Heidelberg (1997)MATHGoogle Scholar
  3. 3.
    Bishop, C.M., Svensén, M., Williams, C.K.I.: Developpements of generative topographic mapping. Neurocomputing 21, 203–224 (1998)MATHCrossRefGoogle Scholar
  4. 4.
    Dempster, A., Laird, N., Rubin, D.: Maximum-likelihood from incomplete data via the EM algorithm. J. Royal Statist. Soc. Ser. B. 39, 1–38 (1977)MATHMathSciNetGoogle Scholar
  5. 5.
    Priam, R., Nadif, M.: Carte auto-organisatrice probabiliste sur données binaires (in french). In: RNTI (EGC 2006 proceedings), pp. 445–456 (2006)Google Scholar
  6. 6.
    Bock, H.: Simultaneous clustering of objects and variables. In: Diday, E. (ed.) Analyse des Données et Informatique, INRIA, pp. 187–203 (1979)Google Scholar
  7. 7.
    Govaert, G.: Classification croisée. In: Thèse d’état, Université Paris 6, France (1983)Google Scholar
  8. 8.
    Govaert, G.: Simultaneous clustering of rows and columns. Control and Cybernetics 24(4), 437–458 (1995)MATHGoogle Scholar
  9. 9.
    Cottrell, M., Ibbou, S., Letrémy, P.: Som-based algorithms for qualitative variables. Neural Networks 17(89), 1149–1167 (2004)MATHCrossRefGoogle Scholar
  10. 10.
    Symons, M.J.: Clustering criteria and multivariate normal mixture. Biometrics 37, 35–43 (1981)MATHCrossRefMathSciNetGoogle Scholar
  11. 11.
    McLachlan, G.J., Basford, K.E.: Mixture Models, Inference and applications to clustering. Marcel Dekker, New York (1988)MATHGoogle Scholar
  12. 12.
    McLachlan, G.J., Peel, D.: Finite Mixture Models. John Wiley and Sons, New York (2000)MATHGoogle Scholar
  13. 13.
    Govaert, G., Nadif, M.: Clustering with block mixture models. Pattern Recognition 36, 463–473 (2003)CrossRefGoogle Scholar
  14. 14.
    Govaert, G., Nadif, M.: An EM algorithm for the block mixture model. IEEE Trans. Pattern Anal. Mach. Intell. 27(4), 643–647 (2005)CrossRefGoogle Scholar
  15. 15.
    Govaert, G., Nadif, M.: Block clustering with bernoulli mixture models: Comparison of different approaches. Computational Statistics & Data Analysis 52, 3233–3245 (2008)CrossRefGoogle Scholar
  16. 16.
    MacKay, D.J.C.: Bayesian interpolation. Neural Computation 4(3), 415–447 (1992)CrossRefGoogle Scholar
  17. 17.
    McCullagh, P., Nelder, J.: Generalized linear models. Chapman and Hall, London (1983)MATHGoogle Scholar
  18. 18.
    Benzecri, J.P.: Correspondence Analysis Handbook. Dekker, New-York (1992)MATHGoogle Scholar
  19. 19.
    Girolami, M.: The topographic organization and visualization of binary data using multivariate-bernoulli latent variable models. IEEE Transactions on Neural Networks 20(6), 1367–1374 (2001)CrossRefGoogle Scholar
  20. 20.
    Kabán, A., Girolami, M.: A combined latent class and trait model for analysis and visualisation of discrete data. IEEE Trans. Pattern Anal. and Mach. Intell., 859–872 (2001)Google Scholar
  21. 21.
    Dhillon, I.: Co-clustering documents and words using bipartite spectral graph partitioning. In: Seventh ACM SIGKDD Conference, San Francisco, California, USA, pp. 269–274 (2001)Google Scholar
  22. 22.
    Hofmann, T.: Probmap - a probabilistic approach for mapping large document collections. Intell. Data Anal. 4(2), 149–164 (2000)MATHGoogle Scholar
  23. 23.
    Kaban, A.: A scalable generative topographic mapping for sparse data sequences. In: ITCC 2005: Proceedings of the International Conference on Information Technology: Coding and Computing (ITCC 2005), Washington, DC, USA, vol. I, pp. 51–56. IEEE Computer Society, Los Alamitos (2005)Google Scholar
  24. 24.
    Kaban, A.: Predictive modelling of heterogeneous sequence collections by topographic ordering of histories. Machine Learning 68(1), 63–95 (2007)CrossRefGoogle Scholar
  25. 25.
    Priam, R.: CASOM: Som for contingency tables and biplot. In: 5th Workshop on Self-Organizing Maps (WSOM 2005), pp. 379–385 (2005)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2008

Authors and Affiliations

  • Rodolphe Priam
    • 1
  • Mohamed Nadif
    • 2
  • Gérard Govaert
    • 3
  1. 1.LMA Poitiers UMR 6086Université de PoitiersFuturoscope ChasseneuilFrance
  2. 2.UFR de Mathématiques et InformatiqueUniversité Paris DescartesParisFrance
  3. 3.Heudiasyc UMR 6599UTCCompiègneFrance

Personalised recommendations