Skip to main content

Interactive Metric Learning-Based Visual Data Exploration: Application to the Visualization of a Scientific Social Network

  • Conference paper
  • First Online:
Information Search, Integration, and Personalization (ISIP 2015)

Abstract

Data visualization is a core approach for understanding data specifics and extracting useful information in a simple and intuitive way. Visual data mining proceeds by projecting multidimensional data onto two-dimensional (2D) or three-dimensional (3D) data, e.g., through mathematical optimization and topology preserved in multidimensional scaling (MDS). However, this projection does not necessarily comply with the user’s needs, prior knowledge and/or expectations. This paper proposes an interactive visual mining approach, centered on the user’s needs and allowing the modification of data visualization by leveraging approaches from metric learning. The paper exemplifies the proposed system, referred to as Interactive Metric Learning-based Visual Data Exploration (IMViDE), applied to scientific social network browsing.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    Note that the classification accuracy maximization can also be tackled by feature selection, that is, a combinatorial optimization problem.

  2. 2.

    http://www.ims.uni-stuttgart.de/projekte/corplex/TreeTagger/.

  3. 3.

    The total differs from the sum of the categories as each article may have more than one subject category.

References

  1. Fayyad, U.M., Piatetsky-Shapiro, G., Smyth, P.: Advances in Knowledge Discovery and Data Mining, pp. 1–34. American Association for Artificial Intelligence, Menlo Park (1996)

    Google Scholar 

  2. Keim, D.: Information visualization and visual data mining. IEEE Trans. Visual Comput. Graphics 8(1), 1–8 (2002)

    Article  MathSciNet  Google Scholar 

  3. Buja, A., Swayne, D.F., Littman, M.L., Dean, N., Hofmann, H., Chen, L.: Data visualization with multidimensional scaling. J. Comput. Graph. Stat. 17(2), 444–472 (2008)

    Article  MathSciNet  Google Scholar 

  4. Broekens, J., Cocx, T.: Object-centered interactive multi-dimensional scaling: ask the expert. In: Proceedings of the Eighteenth Belgium-Netherlands Conference on Artificial Intelligence (BNAIC 2006), pp. 59–66 (2006)

    Google Scholar 

  5. Brown, E.T., Liu, J., Brodley, C.E., Chang, R.: Dis-function: learning distance functions interactively. In: IEEE Conference on on Visual Analytics Science and Technology (VAST 2012), pp. 83–92. IEEE (2012)

    Google Scholar 

  6. Weinberger, K.Q., Saul, L.K.: Distance metric learning for large margin nearest neighbor classification. J. Mach. Learn. Res. 10, 207–244 (2009)

    MATH  Google Scholar 

  7. Jolliffe, I.T.: Principal Component Analysis. Springer, New York (2002)

    MATH  Google Scholar 

  8. Kohonen, T.: Self-organizing Maps. Springer, Heidelberg (2001)

    Book  MATH  Google Scholar 

  9. Bishop, C.M., Svensén, M., Williams, C.K.I.: GTM: the generative topographic mapping. Neural Comput. 10(1), 215–234 (1998)

    Article  MATH  Google Scholar 

  10. Van der Maaten, L., Hinton, G.: Visualizing data using t-SNE. J. Mach. Learn. Res. 9(2579–2605), 85 (2008)

    MATH  Google Scholar 

  11. Jeong, D.H., Ziemkiewicz, C., Fisher, B.D., Ribarsky, W., Chang, R.: iPCA: an interactive system for PCA-based visual analytics. Comput. Graph. Forum 28(3), 767–774 (2009)

    Article  Google Scholar 

  12. Buja, A., Swayne, D.F., Littman, M.L., Dean, N., Hofmann, H., Chen, L.: Data visualization with multidimensional scaling. J. Comput. Graph. Stat. 17(2), 444–472 (2008). doi:10.1198/106186008X318440

    Article  MathSciNet  Google Scholar 

  13. Kim, H., Choo, J., Park, H., Endert, A.: Interaxis: steering scatterplot axes via observation-level interaction. IEEE Trans. Vis. Comput. Graph. 22(1), 131–140 (2016)

    Article  Google Scholar 

  14. Yi, J.S., Melton, R., Stasko, J.T., Jacko, J.A.: Dust & magnet: multivariate information visualization using a magnet metaphor. Inf. Visual. 4(3), 239–256 (2005)

    Google Scholar 

  15. Choo, J., Lee, H., Kihm, J., Park, H.: iVisClassifier: an interactive visual analytics system for classification based on supervised dimension reduction. In: Proceedings of the IEEE Conference on Visual Analytics Science and Technology, IEEE VAST 2010, Salt Lake City, Utah, USA, 24–29 October 2010, part of VisWeek 2010, pp. 27–34 (2010)

    Google Scholar 

  16. Bellet, A., Habrard, A., Sebban, M.: A survey on metric learning for feature vectors and structured data. CoRR abs/1306.6709 (2013)

    Google Scholar 

  17. Goldberger, J., Hinton, G.E., Roweis, S.T., Salakhutdinov, R.: Neighbourhood components analysis. Adv. Neural Inf. Process. Syst. 17, 513–520 (2004)

    Google Scholar 

  18. Davis, J.V., Kulis, B., Jain, P., Sra, S., Dhillon, I.S.: Information-theoretic metric learning. In: Proceedings of the 24th International Conference on Machine Learning, pp. 209–216. ACM (2007)

    Google Scholar 

  19. Qi, G.J., Tang, J., Zha, Z.J., Chua, T.S., Zhang, H.J.: An efficient sparse metric learning in high-dimensional space via l1-penalized log-determinant regularization. In: Proceedings of the 26th Annual International Conference on Machine Learning, ICML 2009, pp. 841–848. ACM, New York (2009)

    Google Scholar 

  20. Leman, S.C., House, L.L., Maiti, D., Endert, A., North, C.: Visual to parametric interaction (v2pi). PloS One 8(3), e50474 (2013)

    Article  Google Scholar 

  21. Joia, P., Coimbra, D.B., Cuminato, J.A., Paulovich, F.V., Nonato, L.G.: Local affine multidimensional projection. IEEE Trans. Vis. Comput. Graph. 17(12), 2563–2571 (2011)

    Article  Google Scholar 

  22. Mizuno, K., Wu, H., Takahashi, S.: Manipulating bilevel feature space for category-aware image exploration. In: IEEE Pacific Visualization Symposium, PacificVis 2014, Yokohama, Japan, 4–7 March 2014, pp. 217–224 (2014)

    Google Scholar 

  23. Hu, X., Bradel, L., Maiti, D., House, L., North, C., Leman, S.: Semantics of directly manipulating spatializations. IEEE Trans. Vis. Comput. Graph. 19(12), 2052–2059 (2013)

    Article  Google Scholar 

  24. Takano, A., Niwa, Y., Nishioka, S., Hisamitsu, T., Iwayama, M., Imaichi, O.: Associative information access using DualNavI. In: Proceedings of the Sixth Natural Language Processing Pacific Rim Symposium, pp. 771–772 (2001)

    Google Scholar 

  25. Cutting, D.R., Pedersen, J.O., Karger, D., Tukey, J.W.: Scatter/gather: a cluster-based approach to browsing large document collections. In: Proceedings of the 15th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 318–329 (1992)

    Google Scholar 

  26. Gong, X., Ke, W., Khare, R.: Studying scatter/gather browsing for web search. Proc. Am. Soc. Inf. Sci. Technol. 49(1), 1–4 (2012)

    Article  Google Scholar 

  27. Zhang, Y., Broussard, R., Ke, W., Gong, X.: Evaluation of a scatter/gather interface for supporting distinct health information search tasks. J. Assoc. Inf. Sci. Technol. 65(5), 1028–1041 (2014)

    Article  Google Scholar 

  28. Leeuw, J.D., Mair, P.: Multidimensional scaling using majorization: SMACOF in R. J. Stat. Softw. 31(3), 30 (2009)

    Article  Google Scholar 

  29. Hofmann, T.: Probabilistic latent semantic indexing. In: Proceedings of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 1999, pp. 50–57. ACM, New York (1999)

    Google Scholar 

  30. Wu, K., Zheng, Z.: Fast lmnn algorithm through random sampling. In: IEEE International Conference on Data Mining Workshop (ICDMW), November 2015, pp. 871–876 (2015)

    Google Scholar 

Download references

Acknowledgement

We would like to thank Prof. Jean-Daniel Fekete for many suggestions and discussion about this work. The first author was partially supported by JSPS KAKENHI Grant Number 25280035.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Masaharu Yoshioka .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Yoshioka, M., Itoh, M., Sebag, M. (2016). Interactive Metric Learning-Based Visual Data Exploration: Application to the Visualization of a Scientific Social Network. In: Grant, E., Kotzinos, D., Laurent, D., Spyratos, N., Tanaka, Y. (eds) Information Search, Integration, and Personalization. ISIP 2015. Communications in Computer and Information Science, vol 622. Springer, Cham. https://doi.org/10.1007/978-3-319-43862-7_8

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-43862-7_8

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-43861-0

  • Online ISBN: 978-3-319-43862-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics