Skip to main content

Text Visualization for Visual Text Analytics

  • Chapter

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4404))

Abstract

The termvisual text analytics describes a class of information analysis techniques and processes that enable knowledge discovery via the use of interactive graphical representations of textual data. These techniques enable discovery and understanding via the recruitment of human visual pattern recognition and spatial reasoning capabilities. Visual text analytics is a subclass of visual data mining / visual analytics, which more generally encompasses analytical techniques that employ visualization of non-physically-based (or “abstract”) data of all types. Text visualization is a key component in visual text analytics. While the term “text visualization” has been used to describe a variety of methods for visualizing both structured and unstructured characteristics of text-based data, it is most closely associated with techniques for depicting the semantic characteristics of the free-text components of documents in large document collections. In contrast with text clustering techniques which serve only to partition text corpora into sets of related items, these so-called semantic mapping methods also typically strive to depict detailed inter- and intra-set similarity structure. Text analytics software typically couples semantic mapping techniques with additional visualization techniques to enable interactive comparison of semantic structure with other characteristics of the information, such as publication date or citation information. In this way, value can be derived from the material in the form of multidimensional relationship patterns existing among the discrete items in the collection. The ultimate goal of these techniques is to enable human understanding and reasoning about the contents of large and complexly related text collections.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Berners-Lee, T., Hendler, J., Lassila, O.: The Semantic Web. Scientific American 284(5), 34–43 (2001)

    Article  Google Scholar 

  2. Feldman, R., Dagan, I.: KDT - Knowledge discovery in text. In: Proceedings of the First International Conference on Knowledge Discovery (KDD 1995), Montreal (1995)

    Google Scholar 

  3. Thomas, J.J., Cook, K.A. (eds.): Illuminating the Path: The Research and Development Agenda for Visual Analytics. IEEE Computer Society Press, Los Alamitos (2005)

    Google Scholar 

  4. Eick, S., et al.: WEB-Based Text Visualization. 1997, National Institute of Statistical Sciences: Technical Report 64. p. http://www.niss.org/technicalreports/tr64.pdf

  5. Rohrer, R.M., Ebert, D.S., Sibert, J.L.: The shape of Shakespeare: Visualizing text using implicit surfaces. In: Proceedings of the IEEE Symposium on Information Visualization 1998, North Carolina (1998)

    Google Scholar 

  6. Miller, N.E., et al.: Topic Islands: A wavelet-based text visualization system. In: Proceedings of the 9th IEEE Conference on Visualization VIS 1998, Research Triangle Park, North Carolina (1998)

    Google Scholar 

  7. Lin, X., Soergel, D., Marchionini, D.: A self-organizing semantic map for information retrieval. In: Proceedings of the Fourteenth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM Press, Chicago (1991)

    Google Scholar 

  8. Chalmers, M., Chitson, P.: Bead: Explorations in information visualization. In: Proceedings of the 15th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM Press, Copenhagen (1992)

    Google Scholar 

  9. Wise, J.A., et al.: Visualizing the non-visual: Spatial analysis and interaction with information from text documents. In: Proceedings of the IEEE Information Visualization Symposium InfoVis 1995, Atlanta, Georgia (1995)

    Google Scholar 

  10. Kohonen, T., et al.: Self organization of a massive document collection. IEEE Transactions on Neural Networks 11(3), 574–585 (2000)

    Article  Google Scholar 

  11. Doyle, L.: Semantic roadmaps for literature searchers. Journal of the Association for Computing Machinery 8(4), 367–391 (1961)

    MathSciNet  Google Scholar 

  12. Sammon, J.W.: A nonlinear mapping for data structure analysis. IEEE Transactions on Computing 18(5), 401–409 (1969)

    Article  Google Scholar 

  13. Crouch, D.: The visual display of information in an information retrieval environment. In: Proceedings of the 9th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM Press, Pisa (1986)

    Google Scholar 

  14. York, J., et al.: Clustering and dimensionality reduction in SPIRE. In: Proceedings of the Symposium on Advanced Information Processing and Analysis, AIPA 1995, Tysons Corner, VA (1995)

    Google Scholar 

  15. Salton, G.: Automatic Text Processing. Addison-Wesley, Reading (1989)

    Google Scholar 

  16. Wall, M.E., Rechtsteiner, A., Rocha, L.M.: Singular value decomposition and principal component analysis. In: Berrar, D.P., Dubitzky, W., Granzow, M. (eds.) A Practical Approach to Microarray Data Analysis, pp. 91–109. Kluwer, Norwell (2003)

    Chapter  Google Scholar 

  17. Deerwester, S.C., et al.: Indexing by latent semantic analysis. Journal of American Society for Information Science 41(6), 391–407 (1990)

    Article  Google Scholar 

  18. Booker, A., et al.: Visualizing text data sets. Computing in Science & Engineering 1(4), 26–35 (1999)

    Article  Google Scholar 

  19. Huang, S., Ward, M., Rundensteiner, E.: Exploration of dimensionality reduction for text visualization. Worcester Polytechnic Institute, Computer Science Department: Technical Report TR-03-14, Worcester (2003)

    Google Scholar 

  20. Chatfield, C., Collins, A.: Introduction to Multivariate Analysis. Chapman & Hall, London (1980)

    MATH  Google Scholar 

  21. Kruskal, J.: Multidimensional scaling by optimizing goodness-of-fit to a nonmetric hypothesis. Psychometrika 29, 1–27 (1964); Reprinted in Key Texts in Multidimensional Scaling, In: Davies, P.M., Coxon, A.P.M.(eds.). Heinemann Educational Books, Exeter, N.H., pp. 59–83 (1982)

    Google Scholar 

  22. Risch, J.S., et al.: The STARLIGHT information visualization system. In: Card, S., Mackinlay, J., Shneiderman, B. (eds.) Readings in Information Visualization: Using Vision To Think, pp. 551–560. Morgan Kaufmann, San Francisco (1999)

    Google Scholar 

  23. Hartigan, J.A.W., A., M.: Algorithm AS136: A k-Means Clustering Algorithm. Applied Statistics 28, 100–108 (1979)

    Article  MATH  Google Scholar 

  24. Bertin, J.: Semiology of Graphics: Diagrams, Networks, Maps. University of Wisconsin Press, Madison (1983)

    Google Scholar 

  25. Lagus, K., Kaski, S., Kohonen, T.: Mining massive document collections by the WEBSOM method. Information Sciences 163(1-3), 135–156 (2004)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Simeon J. Simoff Michael H. Böhlen Arturas Mazeika

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Risch, J., Kao, A., Poteet, S.R., Wu, Y.J.J. (2008). Text Visualization for Visual Text Analytics. In: Simoff, S.J., Böhlen, M.H., Mazeika, A. (eds) Visual Data Mining. Lecture Notes in Computer Science, vol 4404. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-71080-6_11

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-71080-6_11

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-71079-0

  • Online ISBN: 978-3-540-71080-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics