Abstract
The aim of this work is to develop a visualization method for multilevel–multigroup analysis based on a multiway nonlinear dimensionality reduction. The task of the method is to visualize what kinds of members each group is composed and to visualize the similarity between the groups in terms of probability distribution of constituent members. To achieve the task, the proposed method consists of hierarchically coupled tensor self-organizing maps, corresponding to the member/group level. This architecture enables more flexible analysis than ordinary parametric multilevel analysis, as it retains a high level of interpretability supported by strong visualization. We applied the proposed method to one benchmark dataset and two practical datasets: one is the survey data on the football players belonging to different teams and the other is the employee survey data belonging to different departments in a company. Our method successfully visualizes the types of the members that constitute each group as well as visualizes the differences or similarities between the groups.
Similar content being viewed by others
Notes
Actually \(f_{lm}\) becomes a scalar here, but we call it a reference vector for consistent explanation.
50F and 50M include 60s people.
References
Argyriou A, Evgeniou T, Pontil M (2007) Multi-task feature learning. In: Proceedings of the 20th annual conference on neural information processing systems, pp 41–48
Bishop CM, Svensen M, Williams CKI (1998) GTM: the generative topographic mapping. Neural Comput 10:215–234
Cao N, Cui W (2016) Introduction to text visualization techniques, vol 1. Atlantis Briefs in Artificial Intelligence, pp 11–40. doi:10.2991/978-94-6239-186-4
Caruana R (1997) Multitask learning. Mach Learn 28:41–75
Ceulemans E, Wilderjans TF, Kiers HAL, Timmerman ME (2016) Multilevel simultaneous component analysis: a computational shortcut and software package. Behav Res Methods 48:1008–1020. doi:10.3758/s13428-015-0626-8
Csurka G, Bray C, Dance C, Fan L (2004) Visual categorization with bags of keypoints. In: ECCV international workshop on statistical learning in computer vision, pp 1–22
de Leeuw J, Meijer E (2008) Handbook of multilevel analysis. Springer, New York
De Roover K, Timmerman ME, Mesquita B, Ceulemans E (2013) Common and cluster-specific simultaneous component analysis. PLoS ONE 8:e62280. doi:10.1371/journal.pone.0062280
De Roover K, Ceulemans E, Giordani P (2016) Overlapping clusterwise simultaneous component analysis. Chemom Intell Lab Syst 156:249–259
Eslami A, Qannari EM, Kohler A, Bougeard S (2013) General overview of methods of analysis of multi-group datasets. Revue des Nouvelles Technologies de l fInformation 25:108–123
Eslamia A, Qannarib EM, Kohlerd A, Bougeard S (2014) Algorithms for multi-group PLS. J Chemom 28:192–201. doi:10.1002/cem.2593
Evgeniou T, Pontil M (2004) Regularized multi-task learning. In: Proceedings of the 10th ACM SIGKDD international conference on knowledge discovery and data mining, pp 109–117
French BF, Finch WH (2008) Multigroup confirmatory factor analysis: locating the invariant referent sets. Struct Equ Model 15:96–113
Friedman JH (1987) Exploratory projection pursuit. J Am Stat Assoc 82:259–266
Hu L, Cao J, Xu G, Wang J, Gu Z, Cao L (2013) Cross-domain collaborative filtering via bilinear multilevel analysis. In: Proceedings of the twenty-third international joint conference on artificial intelligence. AAAI Press, Beijing, pp 2626–2632
Ishibashi H, Furukawa T (2016) Multilevel–multigroup analysis by hierarchical tensor SOM network. In: Proceedings of ICONIP, pp 459–466
Iwasaki T, Furukawa T (2016) Tensor SOM and Tensor GTM: nonlinear tensor analysis by topographic mappings. Neural Netw 77:107–125
Iwata T, Saito K, Ueda N, Stromsten S, Griffiths TL, Tenenbaum JB (2007) Parametric embedding for class visualization. Neural Comput 19:2536–2556
Iwata T, Yamada T, Ueda N (2008) Probabilistic latent semantic visualization: topic model for visualizing documents. In: Proceedings of the ACM SIGKDD international conference on knowledge discovery and data mining, pp 363–371. doi:10.1145/1401890.1401937
Kamishima T, Kazawa H, Akaho S (2010) A survey and empirical comparison of object ranking methods. In: Fürnkranz J, Hüllermeier E (eds) Preference learning. Springer, Berlin, pp 181–201
Kohonen T (1982) Self-organized formation of topologically correct feature maps. Biol Cybern 43(1):59–69
Kolda TG, Bader BW (2009) Tensor decompositions and applications. SIAM Rev 51(3):455–500
Kwok JTY, Tsang IWH (2004) The pre-image problem in kernel methods. IEEE Trans Neural Netw 15(6):1517–1525. doi:10.1109/tnn.2004.837781. http://www.hpl.hp.com/conferences/icml2003/papers/345.pdf
Lawrence ND (2004) Gaussian process latent variable models for visualisation of high dimensional data. In: Advances in neural information processing systems. Neural information processing systems foundation
Lawrence ND (2005) Probabilistic non-linear principal component analysis with Gaussian process. J Mach Learn Res 6:1783–1816
Le T, Lauw HW (2015) Semantic visualization for spherical representation. In: Proceedings of the ACM SIGKDD international conference on knowledge discovery and data mining, pp 1007–1016
Nguyen V, Phung DQ, Nguyen X, Venkatesh S, Bui HH (2014) Bayesian nonparametric multilevel clustering with group-level contexts. In: Proceedings of international conference on machine learning (ICML), pp 288–296
Pan SJ, Yang Q (2010) A survey on transfer learning. IEEE Trans Knowl Data Eng 22:1345–1359
Schouteden M, Van Deun K, Wilderjans TF, Van Mechelen I (2013) Performing DISCO-SCA to search for distinctive and common information in linked data. Behav Res Methods 46:576–587
Stefanovic P, Kurasova O (2011) Visual analysis of self-organzing maps. Nonlinear Anal Model Control 16(4):488–504
Ten Berge JMF, Kiers HAL, Van der Stel V (2011) A clusterwise simultaneous component method for capturing within-cluster differences in component variances and correlations. Br J Math Stat Psychol 66:81–102. doi:10.1111/j.2044-8317.2012.02040.x
Ten Berge JMF, Kiers HAL, Van der Stel V (2013) Simultaneous components analysis. Stat Appl 4:377–392
Timmerman ME (2006) Multilevel component analysis. Br J Math Stat Psychol 59:301–320
Tsai CF (2012) Bag-of-words representation in image annotation: a review. In: Proceedings of the 9th European conference on computer vision
Ultsch A, Siemon HP (1990) Kohonen’s self organizing feature maps for exploratory data analysis. In: Proceedings INNC’90, international neural network conference, pp 305–308
Wulsin D, Jensen S, Litt B (2012) A hierarchical dirichlet process model with multiple levels of clustering for human eeg seizure modeling. In: Proceedings of the 29th international conference on machine learning
Zhou G, Cichocki A, Zhang Y, Mandic D (2015) Group component analysis for multiblock data: common and individual feature extraction. IEEE Trans Neural Netw Learn Syst 27:2426–2439
Acknowledgements
We would like to thank Prof. Isogai and Dr. Shinriki, Kyushu Institute of Technology, for allowing us to use football player data. We also would like to thank Dr. Iwasaki for much assistance in developing analysis tools and useful discussion.
Author information
Authors and Affiliations
Corresponding author
Additional information
This work is supported by Okawa Foundation for Information and Telecommunications.
Rights and permissions
About this article
Cite this article
Ishibashi, H., Furukawa, T. Hierarchical Tensor SOM Network for Multilevel–Multigroup Analysis. Neural Process Lett 47, 1011–1025 (2018). https://doi.org/10.1007/s11063-017-9643-1
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11063-017-9643-1