Abstract
Data visualization is a core approach for understanding data specifics and extracting useful information in a simple and intuitive way. Visual data mining proceeds by projecting multidimensional data onto two-dimensional (2D) or three-dimensional (3D) data, e.g., through mathematical optimization and topology preserved in multidimensional scaling (MDS). However, this projection does not necessarily comply with the user’s needs, prior knowledge and/or expectations. This paper proposes an interactive visual mining approach, centered on the user’s needs and allowing the modification of data visualization by leveraging approaches from metric learning. The paper exemplifies the proposed system, referred to as Interactive Metric Learning-based Visual Data Exploration (IMViDE), applied to scientific social network browsing.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
Note that the classification accuracy maximization can also be tackled by feature selection, that is, a combinatorial optimization problem.
- 2.
- 3.
The total differs from the sum of the categories as each article may have more than one subject category.
References
Fayyad, U.M., Piatetsky-Shapiro, G., Smyth, P.: Advances in Knowledge Discovery and Data Mining, pp. 1–34. American Association for Artificial Intelligence, Menlo Park (1996)
Keim, D.: Information visualization and visual data mining. IEEE Trans. Visual Comput. Graphics 8(1), 1–8 (2002)
Buja, A., Swayne, D.F., Littman, M.L., Dean, N., Hofmann, H., Chen, L.: Data visualization with multidimensional scaling. J. Comput. Graph. Stat. 17(2), 444–472 (2008)
Broekens, J., Cocx, T.: Object-centered interactive multi-dimensional scaling: ask the expert. In: Proceedings of the Eighteenth Belgium-Netherlands Conference on Artificial Intelligence (BNAIC 2006), pp. 59–66 (2006)
Brown, E.T., Liu, J., Brodley, C.E., Chang, R.: Dis-function: learning distance functions interactively. In: IEEE Conference on on Visual Analytics Science and Technology (VAST 2012), pp. 83–92. IEEE (2012)
Weinberger, K.Q., Saul, L.K.: Distance metric learning for large margin nearest neighbor classification. J. Mach. Learn. Res. 10, 207–244 (2009)
Jolliffe, I.T.: Principal Component Analysis. Springer, New York (2002)
Kohonen, T.: Self-organizing Maps. Springer, Heidelberg (2001)
Bishop, C.M., Svensén, M., Williams, C.K.I.: GTM: the generative topographic mapping. Neural Comput. 10(1), 215–234 (1998)
Van der Maaten, L., Hinton, G.: Visualizing data using t-SNE. J. Mach. Learn. Res. 9(2579–2605), 85 (2008)
Jeong, D.H., Ziemkiewicz, C., Fisher, B.D., Ribarsky, W., Chang, R.: iPCA: an interactive system for PCA-based visual analytics. Comput. Graph. Forum 28(3), 767–774 (2009)
Buja, A., Swayne, D.F., Littman, M.L., Dean, N., Hofmann, H., Chen, L.: Data visualization with multidimensional scaling. J. Comput. Graph. Stat. 17(2), 444–472 (2008). doi:10.1198/106186008X318440
Kim, H., Choo, J., Park, H., Endert, A.: Interaxis: steering scatterplot axes via observation-level interaction. IEEE Trans. Vis. Comput. Graph. 22(1), 131–140 (2016)
Yi, J.S., Melton, R., Stasko, J.T., Jacko, J.A.: Dust & magnet: multivariate information visualization using a magnet metaphor. Inf. Visual. 4(3), 239–256 (2005)
Choo, J., Lee, H., Kihm, J., Park, H.: iVisClassifier: an interactive visual analytics system for classification based on supervised dimension reduction. In: Proceedings of the IEEE Conference on Visual Analytics Science and Technology, IEEE VAST 2010, Salt Lake City, Utah, USA, 24–29 October 2010, part of VisWeek 2010, pp. 27–34 (2010)
Bellet, A., Habrard, A., Sebban, M.: A survey on metric learning for feature vectors and structured data. CoRR abs/1306.6709 (2013)
Goldberger, J., Hinton, G.E., Roweis, S.T., Salakhutdinov, R.: Neighbourhood components analysis. Adv. Neural Inf. Process. Syst. 17, 513–520 (2004)
Davis, J.V., Kulis, B., Jain, P., Sra, S., Dhillon, I.S.: Information-theoretic metric learning. In: Proceedings of the 24th International Conference on Machine Learning, pp. 209–216. ACM (2007)
Qi, G.J., Tang, J., Zha, Z.J., Chua, T.S., Zhang, H.J.: An efficient sparse metric learning in high-dimensional space via l1-penalized log-determinant regularization. In: Proceedings of the 26th Annual International Conference on Machine Learning, ICML 2009, pp. 841–848. ACM, New York (2009)
Leman, S.C., House, L.L., Maiti, D., Endert, A., North, C.: Visual to parametric interaction (v2pi). PloS One 8(3), e50474 (2013)
Joia, P., Coimbra, D.B., Cuminato, J.A., Paulovich, F.V., Nonato, L.G.: Local affine multidimensional projection. IEEE Trans. Vis. Comput. Graph. 17(12), 2563–2571 (2011)
Mizuno, K., Wu, H., Takahashi, S.: Manipulating bilevel feature space for category-aware image exploration. In: IEEE Pacific Visualization Symposium, PacificVis 2014, Yokohama, Japan, 4–7 March 2014, pp. 217–224 (2014)
Hu, X., Bradel, L., Maiti, D., House, L., North, C., Leman, S.: Semantics of directly manipulating spatializations. IEEE Trans. Vis. Comput. Graph. 19(12), 2052–2059 (2013)
Takano, A., Niwa, Y., Nishioka, S., Hisamitsu, T., Iwayama, M., Imaichi, O.: Associative information access using DualNavI. In: Proceedings of the Sixth Natural Language Processing Pacific Rim Symposium, pp. 771–772 (2001)
Cutting, D.R., Pedersen, J.O., Karger, D., Tukey, J.W.: Scatter/gather: a cluster-based approach to browsing large document collections. In: Proceedings of the 15th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 318–329 (1992)
Gong, X., Ke, W., Khare, R.: Studying scatter/gather browsing for web search. Proc. Am. Soc. Inf. Sci. Technol. 49(1), 1–4 (2012)
Zhang, Y., Broussard, R., Ke, W., Gong, X.: Evaluation of a scatter/gather interface for supporting distinct health information search tasks. J. Assoc. Inf. Sci. Technol. 65(5), 1028–1041 (2014)
Leeuw, J.D., Mair, P.: Multidimensional scaling using majorization: SMACOF in R. J. Stat. Softw. 31(3), 30 (2009)
Hofmann, T.: Probabilistic latent semantic indexing. In: Proceedings of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 1999, pp. 50–57. ACM, New York (1999)
Wu, K., Zheng, Z.: Fast lmnn algorithm through random sampling. In: IEEE International Conference on Data Mining Workshop (ICDMW), November 2015, pp. 871–876 (2015)
Acknowledgement
We would like to thank Prof. Jean-Daniel Fekete for many suggestions and discussion about this work. The first author was partially supported by JSPS KAKENHI Grant Number 25280035.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Yoshioka, M., Itoh, M., Sebag, M. (2016). Interactive Metric Learning-Based Visual Data Exploration: Application to the Visualization of a Scientific Social Network. In: Grant, E., Kotzinos, D., Laurent, D., Spyratos, N., Tanaka, Y. (eds) Information Search, Integration, and Personalization. ISIP 2015. Communications in Computer and Information Science, vol 622. Springer, Cham. https://doi.org/10.1007/978-3-319-43862-7_8
Download citation
DOI: https://doi.org/10.1007/978-3-319-43862-7_8
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-43861-0
Online ISBN: 978-3-319-43862-7
eBook Packages: Computer ScienceComputer Science (R0)