Towards a Visual Data Language to Improve Insights into Complex Multidimensional Data
- 1.3k Downloads
Data volume is increasing steadily. Visualization helps to handle not only the volume, but the ever increasing diversity of data. Visualization gives answers faster and reveals information that would go unnoticed and therefore unused in decision making. The challenge we address in this contribution is how visualizations can be created semi-automatic without taking the individual human-centered view of the designer on an interface out of the loop. In this paper, we present a tool-supported design process to develop aesthetic and interactive data visualizations in a conceptual, guided, effective way.
KeywordsInformation design Process model Information visualization Aesthetics Tool-support design
Every information visualization has a very basic need – the narration of a story. Over time, new ways to visualize information were developed. Today, almost everyone is familiar with basic chart types such as line chart or pie chart. Charts are applied to present large amounts of data more understandable than spreadsheets or textual reports. Current visual interfaces that deal with big data only show different charts and graph visualizations arranged in dashboards. Hence, the user has an increased cognitive challenge to merge the visualized datasets in order to get insights into complex information. Visualizations that combine multidimensional data can emphasize answers faster than dashboards and reveal information that would go unnoticed and therefore unused in common chart visualizations. Yet, the design of suitable visual representations of complex and multidimensional datasets requires expert knowledge. The information designer has to employ the right design principles to compose a meaningful story to clarify the complexity of datasets, explaining a process, highlighting a trend, or supporting a specific argument. In consequence, a complex interactive visualization requires months of work by highly skilled professionals. On the other hand, data visualization should be a quick and easy way to convey information. But, it should be remembered that poorly designed data representations can distort the intended message, lose the user’s attention, or fail to guide them toward meaningful conclusions.
In this paper, we present a novel tool-supported approach for dealing with complex datasets apart from the well-established charting perspective. The addressed challenge is to reduce development time for information visualization by providing tools of the trade to realize information design in a conceptual, guided, and effective way. Common libraries, framework and tools used to create visual interfaces currently lack an overall information design process that addresses a human-centered access to multidimensional data sets. Hence, designers and developers are constrained to a manual design process.
The aim is to enhance the design process by a guided process based on visualization tools. To this end, we present the ViDaLa approach, which is able to create beautifully looking information visualizations without the expert knowledge of software engineers and information designers.
2 Related Work
Information visualization is the representation of data in a graphical form. The concept of using images to understand data has been around for centuries, from maps and graphs in the 17th century to the invention of the pie chart in the early 1800s. Today, information visualization has become a rapidly evolving blend of business, science, and art that is defined as a “visual representations of the semantics, or meaning, of information.[…] information visualization typically deals with nonnumeric, nonspatial, and high-dimensional data.” . Designing information visualization can be described as the practice of presenting information in a way that fosters efficient and effective understanding of the fundamental dataset.
There is a vast variety of methods and approaches in information design such as visualization patterns , interaction pattern , design guidelines in visualization  and in interaction design . In  Lau and van de Moere present a model that focus on aesthetics as a conceptual influence on the technical implementation of visualizations. It reveals information aesthetics as the conceptual link between information visualization and visualization art. Lang discusses in  the importance of aesthetic in visualizations related to its efficiency, whereas Kosara in  proposes a classification of several types of information visualization based on aesthetic criteria. Based on aesthetic criteria as well as patterns and guidelines, Fry describes in  the process of creating a data visualization in a very accessible way and introduces a tool that simplifies the computational process for beginners. Munzner defined in  a layer-based workflow for the design process of visualization and validation.
Among conceptual and theoretical approaches, there are different kinds of tools and visualization libraries available today. Visualization grammars as a declarative format for creating and saving interactive visualizations like Vega . Graph libraries as a set of tools to display and layout interactive graphs, for instance chartist-js or chart.js as well as business intelligence software to create interactive analysis dashboards from any data source and publish them to analyze data, high performance for large data sources, wide device support, multiple sources, and easy publishing of functions e.g. .
In conclusion, there are many expert tools and libraries as well as specific methods available that are used to train professionals in the field of information design. Without such training and tool knowledge, it is almost impossible to design and create sophisticated information visualizations of multidimensional data.
3.1 Preliminary Considerations
As shown in the related work section, several models of the visualization process have outlined that there is no one way to create visualizations. There are many different paths through the process and most of them can produce useful results. The most remarkable characteristic of the process is that it is always explorative and iterative. The ViDaLa Process Model is intended to serve as a process template based on the conceptual layers of Munzners workflow for creating visualizations. The concept consists of four nested layers: characterize the task and data in the vocabulary of the problem domain, abstract into operations and data types, design visual encoding and interaction techniques, and create algorithms to execute techniques efficiently . Our Process Model enriches the model, excepting the fourth layer by concrete methods and tools to help users to select appropriate design steps based on the intended usage. The following subsection describe these steps of the ViDaLa approach in detail.
3.2 Problem Characterization and Data Gathering
The first step of the ViDaLa process starts with a period of discovery. Preliminary questions should be asked to adequately develop a strategy for the information visualization in order to create a successful design that suits the needs. Only when domain challenges, user needs, and stakeholder requirements are analyzed during the early stage of the information design process, the whole development can be completed successfully and therefore create information designs that fulfil complex interweaved requirements.
Identify the quantity of data and datasets
Find out the existing file formats and data sources and whether data need to be scraped
Discover the complexity of the dataset
Analyze the data set regarding to numerical range and dimensionality
Find out if the structure of the dataset is directly usable or if it needs some restructuring
Identify the quality of the data
Identify the target audience
Find out the type of content
Discover time and place for the information visualization
Analyze reasons for information need
Identify the manner of use of the information
To discover necessary requirements and answering initial questions, different kinds of mind mapping tools as well as text editor tools can be used to quickly add records and notes to any part of the problem characterization as well as data gathering step and to reorganize aspects on the fly.
3.3 Finding the Narrative
How does the designer categorize and structure information?
How does the designer represent information?
How does the user interact with information?
How does the user search information and on which level of detail?
In order to create the story of the visualization, it is necessary to understand the correlations between users, content, and context. To address this problem, an exploration tool for existing information design was developed that illustrates characteristics of visualizations as well as the usage context. The software tool DelViz (Deep Exploration and Lookup of Visualizations)  supports searching for information visualizations from various points of view The application allows search and analysis in a collection of information visualizations. The data set used by DelViz currently contains 700 visualization projects which are characterized and stored with title, description, preview picture, and link to a demo of the information visualization or to a related website. DelViz is intended to support different search tasks: finding suitable information visualizations for a given context, and analysis of the underlying structured data set to discover relationships between the visualizations. Due to different search strategies, suitable visualizations and forms of depiction for the story can explored (Fig. 2).
3.4 Determining a Visual Approach
After creating the storyboard with the aid of DelViz, the concrete data has to be turned into visuals. Creating a complex information design means evolving a visualization. For this purpose, visual representations for the dataset have to be discovered and determined to tell the given story. In this step of the ViDaLa process, it is an almost 1-to-1 relationship between data and visual representation.
Ware terms the basic building blocks of the visualization as “Preattentive Attributes” . These attributes come into play when we determine the visual representation of data. For instance, position, and length can used to perceive quantitative data with precision. Other attributes are useful for perceiving other types of data such as categorical, or relational data. The choice of representation should address the simplest possible form that conveys the most relevant aspects of the data set. Whatever the case, this decision is guided by the data, which will lend itself to one visual form or a combination of several visual forms. In this design step, many intermediate visualizations have to be designed to end up with the final result. In every iteration, it has to be evaluated, whether the selected basic building blocks of the visualization works for the information design. It is hard to sketch out shape or forms without using real data. Hence, it is very important to walk through this ViDaLa step with real data. To address this problem, a software tool is used that is based on the concept of glyphs, a common form where a data set is depicted by a collection of visual representations . Different properties of the data are depicted as different visual variables of a glyph. To address this challenge of determine the suitability of various visualization dependent on the dataset the GlyphViz was developed.
With GlyphViz different forms can be examined by a fast and flexible creation as well as adaption of glyphs (Fig. 3). Additionally, the tool provides sort functions according to similarity at a pre-selected data. This interactive tool supports decision making by depicting different representations of the data. These decisions should be well-founded due to design and aesthetics, but also based on the dataset and the story of the visualization.
3.5 Creating an Interaction Graph
Interactive data visualizations enable users to focus on interesting parts and details, to customize the content and even the graphical form, and to explore large amounts of data. Therefore, the change in the presentation of a data set fundamentally extends visualization capabilities. Interaction methods involve either how the data interacts with itself or how users can interact and control the data representation. In software, it is easy to provide many options to the user. Yet, it is more difficult, and more important to figure out an efficient and effective interaction in information visualization that is most relevant to the majority of the tasks executed by the majority of users. For instance, highlighting elements and showing details on demand are interactions that are useful for almost all data visualizations. Furthermore, assembling multiple representations in multiple views  and coordinating them clarifies different aspects of the data set at the same time. These are only some tangible quality standards for interactions in visualization as described in [4, 18, 19]. The benefit of such a configurable visualization is to display different structurally similar data sets that the user can interactively change. Additionally, when interactions are used in such a manner, an interactive visualization can make a much larger data set accessible than a comparable static graphic.
A software tool to support this ViDaLa step is still under development, yet focused on user-centered interaction within views, linking views as well as easily implement essential transitions and animations. Intended interactions will follow the guidelines of Shneiderman , the taxonomy of tools that support visualizations  and the seven general categories of interaction techniques described by Yi et al. . Commercial tools such as Spotfire  and Tableau  provide examples of such interactions in visualization specification by drag-and-drop operations.
3.6 Composition and Assembling
4 Case Study
To evaluate the ViDaLa design process, digital medical records of patients in a hospital setting were identified as a complex and multidimensional dataset (Fig. 4). The digital patient record consists of a large number of isolated data entries that are provided by various health information systems and medical devices. In order to transfer the digital patient and decision process model into real clinical applications, the ViDaLa process was applied and the digital medical records of patients were visualized in a graph based-visualization. In the context of interactive representations of clinical processes and information management, a user-intended categorization of information architecture was developed. It was decided to visualize digital patient records as a network visualization. A circle glyph combined with a pie chart is used to visualize data of the patient in a network graph to present dependencies, connections and hierarchies.
With this paper, we illustrated the importance of visualizing information in business units as well as in research and development. The presented ViDaLa approach constitutes a design process for interactive visualizations using a process model that is based on the analysis of given datasets and analysis of a given problem domain.
The presented tool chain is a consistent set of individual tools that are used to perform information design tasks to create an interactive visualization. In general, all of the ViDaLa tools are executed, however, not necessarily executed consecutively. Moreover, the Visual Toolbox contains a visualization library that consists of graphical elements that can be composed and arranged according to the goals of the visualization. The approach presented in this paper allows developers that are not specifically trained in design and information visualization to create visualizations, which assist users in exploring data in order to gain insights and information. Applying the ViDaLa process model, the development time due to a model-based approach can be reduced, while retaining customization of complex visualizations.
ViDaLa aims to make complex and multidimensional data more focused and more understandable for users via visualizations tools and visual analytics methods. The overall goal is to place a strong emphasis on quality over quantity, especially in the big data world. Visualizing data in an effective, creative way will provide more relevant, understandable information. ViDaLa will help to simplify the design process necessary to create visualizations that focus on the value for the end user.
6 Future Work
With the ViDaLa model as a tool-supported information design process, open research questions about the effective and efficient perception of information can be pursued with great ease. This semi-automatic design process offers possibilities to broaden the range of information design in the context of analytics and business intelligence. With the foundation of the current process model, the aim is to identify more criteria, methods, and techniques that can be integrated into the ViDaLa tools of the trade. Therefore, design and development of a concrete interface can be enhanced and the number of design iterations can be reduced. In addition, the software tools supporting the ViDaLa process model are subject to constant further development and maintenance. Ideally, the tools from specification to implementation form a toolchain that can be executed consecutively, by making the output of each tool or design step the input or starting environment for the next one.
- 2.Heer, J., Agrawala, M.: Software design patterns for information visualization. IEEE Trans. Vis. Comput. Graph. 12(5), 853–860 (2006) doi:http://doi.ieeecomputersociety.org/10.1109/TVCG.2006.178 CrossRefGoogle Scholar
- 4.Shneiderman, B.: The eyes have it: a task by data type taxonomy for information visualizations. In: Proceedings of IEEE Symposium on Visual Language, pp. 336–343. IEEE Computer Society Press (1996)Google Scholar
- 5.Borchers, J.O.: Interaction design patterns: twelve theses, pp. 1–6. Workshop Pattern Lang. Interact. Des. ACM Press, Hague, Netherlands (2000)Google Scholar
- 6.Lau, A., Vande Moere, A.: Towards a model of information aesthetics in information visualization. In: 11th International Conference on Information Visualization 2007 IV07, pp. 87–92. IEEE (2007)Google Scholar
- 7.Lang, A.: Aesthetics in information visualization. Trends Information Visualization, 8 (2009)Google Scholar
- 8.Kosara, R.: Visualization criticism-the missing link between information visualization and art. In: 11th International Conference on Information Visualization 2007 IV07, pp. 631–636. IEEE (2007)Google Scholar
- 9.Fry, B.J.: Computational information design. Massachusetts Institute of Technology (2004)Google Scholar
- 11.Satyanarayan, A., Wongsuphasawat, K., Heer, J.: Declarative interaction design for data visualization. In: Proceedings of 27th Annual ACM Symposium User Interface Software Technology, pp. 669–678. ACM (2014)Google Scholar
- 12.García, M., Harmsen, B.: Qlikview 11 for Developers. Packt Publishing Ltd, Birmingham (2012)Google Scholar
- 13.Fry, B.: Visualizing Data, 1st edn. O’Reilly Media, Sebastopol (2008)Google Scholar
- 14.Keck, M., Kammer, D., Iwan, R., et al.: DelViz: exploration of tagged information visualizations. In: Interaktion und Visualisierung im Daten-Web (2011)Google Scholar
- 15.Ware, C.: Information Visualization. Perception for Design, 2. Auflage. Elsevier Ltd, Amsterdam (2004)Google Scholar
- 16.de Almeida Madeira Clemente, M., Keck, M., Groh, R.: TagStar: a glyph-based interface for indexing and visual analysis. In: Proceedings of 2014 International Working Conference on Advanced Visual Interfaces, pp. 357–358. ACM (2014)Google Scholar
- 17.North, C.L., Shneiderman, B.: A Taxonomy of Multiple Window Coordinations. Department of Computer Science, University of Maryland, USA (1997)Google Scholar
- 18.Shneiderman, B.: Designing the User Interface. Strategies for Effective Human-Computer Interaction, 4th edn. Addison-Wesley, Boston (2004)Google Scholar
- 19.Tidwell, J.: Interaction design patterns. In: Conference on Pattern Language Programming (1998)Google Scholar
- 21.Spotfire. http://spotfire.tibco.com/de/. Accessed 12 Feb 2016
- 22.Tableau. http://www.tableau.com