1 Introduction

Collecting and tracking personal data is becoming increasingly popular in various domains, ranging from physical activity (e.g., FitBit) to sleep (e.g., SleepCycle), food and water consumption (e.g., MyFitnessPal), and financial expenses (e.g., Mint) [18]. Tracking personal data can support people in becoming more aware of their habits and behaviors, changing their behavior, or reaching specific goals (such as spending less money or being more physically active) [8]. These systems are often called behavior change technologies (BCT) [25]. An important part of many BCTs is the way in which they present feedback, often delivered through personal visualizations (PVs). One of the main challenges that PVs face is the diverse personalities and contexts that they should cater for [26]. As an example, for some users direct feedback, e.g., on the amount of weight gained or the amount of energy consumed, might induce negative feelings such as guilt or shame, while other users might appreciate the honesty of the technology and feel motivated. Several studies have reported that PVs provoked negative feelings among some users [28, 39]. These negative experiences are named as a reason for the limited success of persuasive technologies, particularly in the long run [6]. One way to address this problem is to adapt the design of PVs to the individual needs and preferences of a user.

Fig. 1.
figure 1

Screenshots of the Donut (left), Creature (middle) and Glass (right) personal data visualization in the liquid-intake tracking application developed for our study.

In this work, we therefore tried to gain a better understanding of how preferences for PVs correlate with stable, well-researched personal traits based on the established Big Five personality theory. We developed three sample visualizations to convey users’ daily liquid intake based on a review of existing liquid intake tracking applications: a plain visualization (Donut), and two decorated visualizations – namely a visualization based on the metaphor of a glass (Glass), and a visualization based on the metaphor of a creature (Creature) as depicted in Fig. 1. We chose liquid intake as an example tracking metric because a broad range of users can relate to this use case. In a lab study, 36 participants were introduced to the three visualizations, conducted a personality test, and rated each visualization with respect to its attractiveness, its motivating or deterring effect, and their interest in its state.

Our hypothesis was that depending on the user’s personality visualizations that make use of metaphors such as Creature and Glass might either be perceived as more engaging or deterring. Our results support this hypothesis. This paper reports the correlations between preferences for visualizations and personality traits and discusses directions for future research.

2 Related Work

2.1 Personal Visualizations (PVs)

Huang et al. [26] defined the field of personal visualizations (PV) as “the design of interactive visual data representations for use in a personal context”. Examples for PVs range from counts, graphs, and tables [11, 18] via stylized displays [12, 30], watchfaces [21] and narrative information [40] to data physicalisations [38] and living plants [5]. However, the suitability of different PVs for different contexts, goals, or users has rarely been systematically explored with a few exceptions: Epstein et al. [18] investigated how well different “visual cuts” allow users to explore their location and physical activity data. The visual cuts in their study were tables, graphs, captures, and maps. Because visualizations that people valued varied dramatically, Epstein et al. [18] recommended using a machine learning approach to offer users more effective or appropriate visualizations.

Choe et al. [7] and Eikey et al. [17] focused on persuasive performance feedback and investigated the influence of several design choices in health applications on self-efficacy. More specifically, Choe et al. [7] varied the valence of feedback (displaying the steps remaining to reach daily goal vs. displaying the achieved steps) and measured users’ self-efficacy. They concluded that the achieved framing positively impacted participants’ self-efficacy. Eikey et al. [17] varied the effect of color and distance to a goal and measured users’ self-efficacy for accomplishing a goal. While they did not find significant results for color choice, they concluded that the remaining distance to a goal influenced users’ self-efficacy.

While these studies revealed interesting results about the design of progress bars and text feedback, our study relates users’ personalities and gender to their preferences for plain (Donut) and decorated visualizations (Glass, and Creature). Several researchers have explored potential advantages of decorative elements in charts, including for example memorability of charts, as examined by both Bateman et al. [4] and Li and Moacdieh [31]. While Bateman [4] concluded that visualizations with decorative elements were more memorable and preferred by most participants, Li [31] presented more ambiguous results: Some people perceived the decorative elements as inefficient, unprofessional and overloaded, while others perceived them as cute, attractive, and interesting. However, we do not yet understand, which users prefer what kind of visualization and in what context.

There is a lack of studies looking at users’ preferences for decorative elements in PVs with respect to their individual differences. However, the idea of adapting visualizations has been applied to other types of visualizations under the name of user-adaptive visualizations.

2.2 User-Adaptive Information Visualization

Several researchers (e.g., [9, 10, 23, 46]) investigated the idea of user-adaptive visualizations, which are visualizations tailored in real-time to the needs and abilities of each user [9]. Existing work in this area has investigated the influence of both cognitive abilities [10, 41] and personality traits [22, 46] on the effectiveness of visualizations. Early results support the hypothesis that adapting visualizations to individual differences increases performance and satisfaction with visualizations significantly. However, existing studies focused on expert visualizations with the goal to extract complex information as quickly as possible. In contrast, the goal of the PVs used in our study is to motivate users to monitor their own data and work towards a goal. As discussed by Huang et al. [26], the nature of PVs is very different from expert visualizations. We are not aware of any studies that investigate how to adapt PVs to users’ personality to motivate them towards reaching a goal.

2.3 The Influence of Personality

One possible source of influence on the perception of different types of data visualizations roots in users’ personality dispositions. In psychological science, the dominant personality theory is the lexically derived Big Five personality trait theory [13, 20]. In this theory personality is described by five broad traits, each of which consists of six sub-facets representing more narrow aspects of the personality. Below, we briefly summarize the Big Five:

People scoring high on extraversion are more sociable, lively, risk-taking, dominant, and in general prefer being in company with others over being alone. People scoring low on this personality trait are usually described as more reserved, reflected, and do not find much pleasure in spending time with many people [3]. People scoring high on emotional stability experience less negative emotions, feel more evenly-tempered, less impulsive, and also less responsive to stress. Low scores in emotional stability correspond to more negative feelings and longer durations of those [2]. Conscientiousness is a personality trait that describes how dutiful, self-disciplined, organized and deliberate someone is [32]. People scoring low on this dimension are typically more laid-back, less urged towards achievement, less goal-oriented, and less driven by success. This personality trait is closely related to academic as well as work place performance, and is therefore often assessed in job interviews [45]. The personality trait of Agreeableness is important for inter-personal relationships and describes how kind, sympathetic as well as helpful and considerate someone is. People with low scores usually are not very concerned about the feelings of people and are less likely to help others generally preferring competition over cooperation [27]. Openness is the personality trait that in general describes how curious someone is towards new experiences, feelings and attitudes. People with low values on this trait are generally described as more traditional, conservative and less flexible in their thinking and behavior. Openness is also correlated with intelligence and creativity [15].

We assume that users’ preference for certain PVs might correlate with certain personality traits and sub-facets

3 Prototype

For our lab study, we chose liquid intake as an example for self-tracking because the recommendation to drink a certain amount of liquid every day is wide-spread and well-known. Moreover, liquid volume is a simple metric to represent.

Instead of developing our own visualizations, we wanted to pick up state-of-the-art visualizations that are already being used in applications today. Hence, we reviewed applications in app stores and found three broader visualization categories: those using plain charts and graphs, those using non-living metaphors (such as glasses and bottles) and those using living metaphors (such as plants and animals). As depicted in Fig. 2, we picked a plain visualization (Donut) and a living metaphor (Creature) as the two ends of a spectrum as we expected that users’ personality might influence their preference for one or another. We added a non-living metaphor (Glass) as a third visualization to see whether it would be perceived similarly to the living metaphor.

Fig. 2.
figure 2

Sketches of the three different concepts for visualizing liquid intake status compared in our study (left: Donut, middle: Creature, right: Glass).

3.1 Donut Chart

Donut is based on PVs often used by commercial tracking software such as FitbitFootnote 1, Jawbone UpFootnote 2, WithingsFootnote 3, or the water consumption tracker “Hydrate Daily”. Before any liquid intake was recorded, the circular bar chart is completely grey. When the user adds predefined amounts of liquid to the counter, the blue bar grows clockwise, filling up the circular bar to represent the user’s progression towards the daily liquid goal (see Fig. 2 left).

3.2 Creature Metaphor

Creature makes use of the metaphor of a living plant. Empathy and compassion can be strong motivators and might therefore be used to foster behavior change [19]. There is evidence that users can emotionally engage with artificial pets and in many cases these emotions have therapeutic effects [34]. Botros et al. [5] used this effect by linking the well-being of a real plant to the physical activity of a user. Related, Creature displays a creature that suffers and is afraid to die of thirst when users themselves do not drink enough water. When a user adds predefined amounts of liquid, the creature becomes happier to represent the user’s progression towards the daily liquid goal (see Fig. 3). Hence, the user is now responsible for the well-being of another virtual being. Several apps in the app store such as “Plant Nanny” use a similar concept.

Fig. 3.
figure 3

Creature visualizes a user’s intake of predefined liquid amounts with its progressing states of happiness.

3.3 Glass Metaphor

To allow the comparison between living and non-living metaphors, we introduced Glass, as shown in Fig. 2 right. It uses the metaphor of a glass, just as several apps in the app stores do (e.g., “Daily Water Free”). Again the water level (similar to the blue bar in Donut) fills up with the percentage of the daily liquid goal that has already been consumed.

4 Method

We ran a mixed between-within-subjects design with a 3 Concepts (Donut vs. Glass vs. Creature) \(\times \)Gender (male vs. female) design and counterbalanced Concepts with a Latin Square for each Gender. We chose a lab setting for this initial exploration (instead of a field study) to reduce the influence of the context of use on our results.

4.1 Evaluation Metrics

As Huang et al. [26] pointed out, the evaluation of PVs presents a new challenge because typical visualization metrics (time and error rate) do not suffice and no best practices exist. They suggest “ease” as a new metric that defines how easily a tool fits into one’s daily life, habits, and routine. “Ease” can, therefore, only be measured in long-term studies, in which participants use the PV in their daily life. Even though this is an important aspect of PV evaluations, we wanted to measure immediate aspects of the user experience that influence whether users feel inclined to use a PV again. Guided by literature on effects of design and information presentation on engagement [24, 35], motivation [14, 38], and self-efficacy [7, 17], we decided to use attractiveness, motivating effect, interest in its state, and deterring effect as dependent variables. To measure the dependent variables, we asked participants to express their agreement to the following statements on 7-point likert scales (statements translated from German):

  • attractiveness: “I perceive the visualization as attractive.”

  • motivating effect: “I feel motivated to drink more water after looking at the visualization.”

  • interest in its state: “I’m interested in the state of the visualization. I would look at it throughout the day.”

  • deterring effect: “I perceive the visualization as deterring.”

In Addition, we included three questions to validate whether all data visualizations and tasks were easy to understand.

4.2 Task

The purpose of the study task was that participants understood the three distinct concepts for visualizing liquid intake goals. Therefore, we asked participants to add predefined amounts of liquid to the liquid intake counter using the buttons provided and to verbalize their thoughts and questions using a think-aloud technique. The researcher who conducted the experiment took notes of all comments and questions. Afterwards, participants were asked whether they understood the concept of the visualizations and any remaining questions were clarified. This procedure took between 3 and 10 min for each visualization.

4.3 Personality Test

We measured participants’ personality traits using the Big Five Personality Inventory (BFSI) [1] in a laboratory setting. Performing the test took 18 min on average. We selected the BFSI for personality assessment due to its short length as well as its favorable psychometric properties.

4.4 Procedure

Our study set-up consisted of seven main steps: (1) informed consent (2) exploring the interactive prototype, (3) individual rating of each concept, (4) enforced ranking of all three concepts, (5) post-questionnaire about smart-phone usage, attitude towards tracking liquid intake, and demographics, and (6) personality test. Finally, (7) participants had the chance to express their opinions verbally, comments were noted down, clustered by researchers and considered in the analysis in addition to quantitative results. We used a unique identifier to link the different anonymized data types from questionnaires and personality test.

4.5 Participants

We recruited 36 paid participants (18 female) through university mailing lists and social networks. Most participants were bachelor, master and PhD students with 44% studying Computer Science or similar subjects. Almost all participants (94%) were smart-phone users with 45% having used smart-phones for at least three years. The majority of participants (72,2%) had “tried to drink a certain amount of liquid every day before” confirming the relevance of the chosen scenario.

4.6 Data Analysis

To examine all possible relationships between personality, gender and preferences, we took an exploratory approach in our study design and data analysis. Due to the exploratory nature of our study, we report 95% confidence intervals (CI) instead of p-values [36]. Confidence intervals not only provide information about the estimated range of the true values in the population (e.g., with repeated measures the CI would contain the correlation value of the population in 95% of the cases), but they also inform about significance in hypothesis testing research (not significant if the CI contains zero). In other words, CIs inform about the precision of the obtained estimator (narrow is good, wide is bad). These values inform hypotheses to be tested in future studies and should not readily be interpreted as established facts. Since we measured preferences for visualizations in ordinal scores, we used Spearman correlations to analyse correlations between ratings and personality dimensions [44]. Since scores on personality dimensions were normally distributed for both genders in our sample, we used Pearson product-moment correlation for correlations between predictor variables (see Fig. 4).

4.7 Limitations

Our study design has the following limitations: (1) Personality is a very complex phenomenon to study. Hence, it needs more than one study to build up an understanding of how individual differences are connected with preferences for visualizations. This study, therefore, can only present a starting point. As the study took on average 45 min to complete, we had to limit the number of PVs to three and restrict the amount of participants (N = 36). Due to these restrictions, correlations have to be interpreted with caution. More example PVs need to be tested to be able to generalize our results. However, given that many correlation coefficients feature medium effect sizes (above a threshold of .30) and given that many correlations are in line with psychological literature, we believe that the results provide first insights into the relationships between personality, gender and preferences for PVs.

(2) Arguably, the scales, underlying the visualizations we used in our study, differ in their nature: Donut and Glass are based on a donut and a bar chart and can therefore easily be used as linear scales. Using Creature as linear scale is more difficult as value differences can not be clearly mapped to pixels: small differences in the gesture and facial expression of the creature might have a smaller or bigger effect on its expression and are open to interpretation by their users. To avoid issues with just noticeable differences we implemented the three visualizations as ordinal scales in our study (users could only add predefined amounts of liquid with provided buttons and those amounts were mapped to distinct states). Although the focus of our study was to compare three general visualization ideas and not on how these visualizations precisely progress, we conducted a prestudy to confirm that the different states of Creature were readable and clearly ordered in the eye of potential users. Therefore, we are confident that the differences of underlying scales did not affect our results. Nevertheless, readers should be aware of the potentially different perceptions of the scales underlying our visualizations.

(3) In line with Eikey et al.’s [17] approach, we measured participants’ perceptions via self-report as this research is in a too early state to measure actual behavior. As we measured perceived attractiveness and interest in the state of the visualizations, we think the results of our study could inform a broader set of personal visualization systems and therefore have an impact beyond behavior change. We chose a controlled lab experiment to ensure the validity of the personality test and to minimize the influence of other contextual factors. However, whether long-term preferences differ (or not) needs to be tested in a long-term study. This will also allow to verify effectiveness towards behavior change, the main goal PVs for behavior change.

5 Results

We will first describe correlations between predictor variables and overall ratings of PVs in descriptive statistics. Next, we will describe correlations between preferences and personality traits.

5.1 Descriptive Statistics

Personality and Gender. For completeness, we report correlations between predictor variables. There were no significant correlations between gender and the Big Five personality factors. However, there were correlations between several Big Five personality factors (as shown in Fig. 4). The highest correlation was observed between extraversion and emotional stability (\(\rho \) = 0.639, p \(< 0.001\)). We calculated variance inflation factors (VIF) for both extraversion (VIF = 1.5) and emotional stability (VIF = 2.1). As the VIF was smaller than 4 in both cases, we proceeded with the analysis [16].

Fig. 4.
figure 4

Pairwise Pearson correlations between the predictor variables, namely, the Big Five measures and gender. Negative values mean a negative correlation, positive values a positive correlation. Gender is encoded as 0:=male, 1:=female.

Fig. 5.
figure 5

Spearman’s rank correlations: Participants rated the three concepts with regard to the criteria on the left on a seven-point Likert scale where 1:=“I absolutely disagree”, 7:=“I absolutely agree”. The values of the Big Five dimensions were measured with the BFSI, gender is encoded as 0:=male, 1:=female.

Overall Ratings of PVs. All three concepts received ratings of four or higher on attractiveness by a majority of participants (Donut: 75%, Glass: 61.1%, Creature: 75%). Creature was most often perceived as motivating (Donut: 44.4%, Glass: 52.8%, Creature: 61.1%). At the same time, the motivating effect of Creature also seemed to be more controversial (Donut: \(M\)=\(4.17\), \(SD\)=\(1.46\), Glass: \(M\)=\(4.58\), \(SD\)=\(1.40\), Creature: \(M\)=\(4.67\), \(SD\)=\(1.97\)). Moreover, more women perceived Creature as motivating than men (w: 72,2%, m: 50,0%). The percentage of people interested in the PV’s state was highest for Glass (Donut: 55.6%, Glass: 72.2%, Creature: 58.3%). An equal number of men and women was interested in the state of Glass (72%). All three concepts were perceived as deterring by only one participant: Donut and Glass by a female, and Creature by a male participant.

5.2 Correlations Between Preferences and Personality Traits

Correlations between Likert scale ratings and the five main BFSI personality dimensions are visualized in Fig. 5. Correlations with subdimensions are shown in Fig. 6.

Fig. 6.
figure 6

Spearman’s rank correlations: Participants rated the three concepts with regard to the criteria on the left on a seven-point Likert scale where 1:=“I absolutely disagree”, 7:=“I absolutely agree”. The values of the subdimensions of the Big Five dimensions were measured with the BFSI, gender is encoded as 0:=male, 1:=female.

We report correlations with a coefficient bigger than 0.2, respectively smaller than −0.2, in the text. Below, we first describe correlations between personality dimensions and Creature and Donut as these PVs were chosen as two ends of a spectrum. Then, we describe correlations with Donut, a PV that we added to contrast Creature with a non-living metaphor. Finally, we describe correlations with gender.

Creature Metaphor. High scores in openness, extraversion, agreeableness, and emotional stability positively correlated with perceptions of Creature as attractive and motivating while high scores in conscientiousness correlated with perceiving Creature as deterring. More details on correlations between ratings of Creature and the main and subdimensions of personality, including 95% confidence intervals, are displayed in Fig. 7.

Fig. 7.
figure 7

Displaying the 95% confidence intervals of spearman correlations between ratings of Creature and personality dimensions

Donut Chart. Correlations between personality and ratings of Donut were generally very small. Only openness to aesthetics, a subdimension of openness, correlated with higher ratings of Donut as attractive.

Glass Metaphor. Higher scores on agreeableness correlated with higher interest in the state of Glass, and with higher ratings of Glass as attractive and as motivating. Lower scores in emotional stability correlated with higher ratings of Glass as deterring. More details on correlations between ratings of Glass and the main and subdimensions of personality, including 95% confidence intervals, are displayed in Fig. 8.

Fig. 8.
figure 8

Displaying the 95% confidence intervals of spearman correlations between ratings of Glass and personality dimensions

5.3 Correlations with Gender

Both women and men perceived Donut and Glass similarly. However, women rated Creature more often as attractive, motivating, and were more often interested in its state than men and than in the states of Glass and Donut.

5.4 Qualitative Comments

From qualitative comments and discussions in our study several alternative themes and metaphors for PVs emerged. Themes suggested by participants included rivers that drain or flood, ships that go aground or sail through heavy swell, and illustrations telling the story of physiological processes related to liquid intake. Moreover, some users in our study mentioned that they would prefer a visualization where the amount of liquid they consumed was subtracted of the daily goal rather than added up (remaining framing). However, as we did not collect preferences on the valence of feedback by all participants, we were not able to investigate if preferences coincide with users’ personality or gender.

6 Discussion and Future Work

We see three main takeaways of our study: (1) It seems likely that preference and aversion for PVs coincide with users’ personalities as measured with the Big Five personality questionnaire. (2) To create PVs that are as engaging as possible, they need to be adapted to suit users preferences. This might be achieved in several ways. (3) Finally, our work provides guidance on how personalized PVs can be evaluated in future research.

6.1 Personality and Preferences for PVs

In our study, especially the personality traits openness (openness to new ideas, aesthetics, and openness of value system), extraversion (love of adventure, friendliness, and joyfulness), and agreeableness correlated with positive ratings of Creature. This observation is in line with previous research, that associated these personality traits with a need for affect [33]. People with a high need for affect tend to choose, for example, emotional movies or to become involved in an emotion-inducing event [33]. Hence, it seems plausible that related personality traits correlate with a preference for PVs, which make use of decorative elements and metaphors. Moreover, we found a tendency that conscientious people (described as dutiful, organized, self-disciplined, and deliberate) found Creature more deterring. In previous research conscientiousness was negatively associated with internet use [29] and playing videogames [43]. Hence, it seems plausible that more conscientious people are less drawn towards entertaining and decorating elements and prefer clean and plain visualizations. Correlations between perceptions of Glass and personality traits were somewhat similar but less and weaker than correlations with perceptions of Creature. Moreover, in our study, women tended to rate Creature more positively than Donut and Glass. These findings bring a new perspective to discussions of the usefulness of decorative elements in charts. It seems likely that such decorative elements can make PVs more engaging especially for users with a certain personality. However, future research is necessary validate these findings with more users and to explore a wider set of visualizations.

6.2 Design Space of Personalized PVs

We limited the set of PVs in our study to three, in order to keep the length of the experiment reasonable. These visualizations are examples taken from a rich design space. This design space encompasses, for example, different themes and metaphors not limited to plants and animals. Even for the relatively simple example of liquid intake visualizations, a variety of themes and metaphors emerged during our study in qualitative comments and discussions with participants. Future work should further analyze and explore this design space, e.g., in participatory design sessions and investigate how preferences are connected with users’ individual differences.

6.3 Evaluation of Personalized PVs

To explore the design space of PVs and the suitability of visualizations for different users and user groups, it is necessary to define evaluation criteria. Established evaluation metrics for visualizations - such as time and error rate - are, however, unsuitable for PVs. Rather than efficiency, the user experience is more likely to influence whether the PV is adopted and used over a longer period of time. In this study, we therefore used attractiveness, motivating effect, deterring effect, and interest in its state as evaluation metrics. However, even though these metrics allowed us to gain a first understanding of users’ perceptions, they are tentative and most certainly need to be rethought in future studies. Beyond these, there are likely other relevant metrics, e.g., related to user experience or retention. If PVs are part of BCT actual behavior change is an important metric to measure in the long term. Future work should expand and explore the set of evaluation techniques used for PVs including the use of PVs in context and in the long term.

6.4 Adaptive vs. Adaptable PVs

Once a better understanding of the connection between users’ individual differences and preferences for PVs is established, an important question to explore is how PVs could be adapted to better suit users’ individual needs. As for all personalized systems, two general ways to achieve this personalization are to allow users to explicitly choose preferred visualizations (e.g., with a flexible framework that helps people design visualizations for themselves, as suggested by Huang et al. [26] or automatic adaptation. Automatic adaptation requires a user model that includes relevant characteristics of the user - in our case personality traits. These parameters might for example be inferred from mobile phone usage behaviors. Other trait-like factors such as cooperation have already been successfully predicted [37]. As data sets that contain both smartphone usage and personality data have already been collected from over 40.000 users, predicting personality traits based on mobile phone usage in the near future seems feasable [42]. However, some questions need to be answered before automatic adaptation becomes an option: When and how is a visualization adapted? How can adaptations be suggested without annoying or irritating the user?

7 Conclusion

There are many ways to visualize a user’s personal data. PVs can be plain and direct like counts and graphs or more abstract like stylized displays or data physicalizations. Previous research showed that users seem to perceive different kinds of PVs very differently. To shed light on potential relationships between personality, gender, and preferences for PVs, we conducted an exploratory lab study with three example PVs. According to our results, preferences for PVs do indeed coincide with personality: For example, participants who scored high on openness and extraversion were more likely to prefer the PV with a living metaphor, while participants who scored high on conscientiousness rated it more negatively. Our results are in line with previous psychological research. Therefore, we are confident that our results provide promising insights into the relationships between personality and preferences for PVs. However, our research can not be more than a starting point at this stage and further validation of our findings is needed.